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1. (X) Amend the specification by inserting before the first 
line the sentence: -- This application is a continuation 
of copending application Serial No. 08/471,833, filed 
June 6, 1995, which is a divisional of copending 
application Serial No. 08/036,555 filed March 24, 1993 
(now U.S. Patent No. 5,530,555), which is a continuation- 
in-part of Serial No. 07/863,703 filed April 3, 1992 
(abandoned) , which in turn is a continuation-in-part of 
Serial No. 07/907,138 filed June 30, 1992 (abandoned), 
which is a continuation-in-part of Serial No. 07/940,389 
filed September 3, 1992 (abandoned), which is a 
continuation-in-part of Serial No. 07/965,173 filed 
October 23, 1992 (abandoned). --. 

2a. (X) Priority is hereby claimed under 35 USC 119 on the basis 

of United Kingdom Application Serial No. 

91 07566.3 filed on April 10, 1991 

2b. (X) The priority document (2a) was filed in the parent 

application Serial No. 08/036 , 555 , filed 

March 24, 1993 ; and filing of the priority document was 
acknowledged in a communication from the Patent Office on 
Nov. 29, 1995 in application Serial No. 08/471 , 833 . 

3a. ( ) Transfer the drawings from the prior application to this 
application and abandon said prior application as of the 
filing date accorded this application. A duplicate copy 
of this sheet is enclosed for filing in the prior 
application file. 

3b . ( ) New formal drawings are enclosed . 

4. (X) The prior application is assigned to Cambridge 

NeuroScience, One Kendall Square, Cambridge, MA 0213 9 and 
Ludwig Institute for Cancer Research, 1345 Avenue of the 
Americas, New York, N.Y. 10105. 

5. ( (X) The power of attorney in the prior application is to John 

E. Lynch, Reg. No. 20,94 0; Peter F. Felfe, Reg. No. 
20,297; Alfred H. Hemingway, Jr., Reg. No. 26,736; 
Vincent M. Fazzari, Reg. No. 26,879; Norman D. Hanson, 
Reg. No. 3 0,946; F. Brice Faller, Reg. No. 29,532; Andrew 
L. Tiajoloff, Reg. No. 31,575; and John A. Bauer, Reg. 
No. 32,554, my attorneys with full power of substitution 
and revocation. 

Address all correspondence to : FELFE & LYNCH 

805 Third Avenue 
New York, New York 10022 
(Telephone: 212-688-9200) 



6a. (X) Cancel claims 1-131 without prejudice. 
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6b. (X) Preliminary Amendment. 

6c. (X) Associate Power of Attorney. 

Respectfully submitted, 

FELFE Sc LYNCH 



B y XK&ik k^&Ot^ 

Madeline F. Baer 
Reg. No. 36,437 

805 Third Avenue 
New York, N.Y. 10022 
(212) 688-9200 

Enclosure 



(X$. Duplicate 



( ) Triplicate 
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CNS 5250.26-JEL/MFB 



N THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant (s) 
Serial No. 
- Filed 
For 



Andrew Goodearl et al . 

Continuation of Serial No. 08/471,833 

Concurrently herewith 

GLIAL MITOGENIC FACTORS, THEIR 
PREPARATION AND USE 



Honorable Commissioner of Patents 

and Trademarks 
Washington , D . C . 20231 

PRELIMINARY AMENDMENT 

Sir: 

Prior to examination, please amend this application as 
follows : 

IN THE SPECIFICATION 

Page 2, lines 2-6: replace with the following: 
-- This application is a continuation of copending application 
Serial No. 08/471,833, filed June 6, 1995, which is a divisional of 
copending application Serial No. 08/036,555 filed March 24, 1993 
(now U.S. Patent No. 5,53 0,555) , which is a continuation-in-part of 
Serial No. 07/863,703 filed April 3, 1992 (abandoned), which in 
turn is a continuation-in-part of Serial No. 07/907,138 filed June 
30, 1992 (abandoned) , which is a continuation-in-part of Serial No. 
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07/940,389 filed September 3, 1992 (abandoned), which is a 
continuation-in-part of Serial No. 07/965,173 filed October 23, 
1992 (abandoned). --. 

„N THE CLAIMS 

Cancel claims 1-131 without prejudice. 
Please add claims 132-140 as follows: 

132. A method for inducing myelination of a neural cell by a 
glial cell, comprising contacting said cell with an amount of a 
polypeptide which comprises an epidermal growth factor- like domain 
the amino acid sequence of which is identical to an amino acid 
sequence encoded by a GGF/pl85 erb B2 ligand gene sufficient to 
induce myelination of a neural cell by said glial cell. 

133. The method of claim 132, wherein said epidermal growth 
factor like domain comprises the amino acid sequence set forth in 
SEQ ID NO: 177. 

134. The method of claim 132, wherein said epidermal growth 
factor like domain comprises the amino acid sequence set forth in 
SEQ ID NO: 178. 
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135. The method of claim 132, wherein said epidermal growth 
factor like domain comprises the amino acid sequence set forth in 
SEQ ID NO: 42. 

136. The method of claim 133, wherein said epidermal growth 
factor like domain further comprises SEQ ID NO: 178, wherein SEQ ID 
NO: 178 is C-terminal to SEQ ID NO: 177. 

137. The method of claim 133, wherein said epidermal growth 
factor like domain further comprises SEQ ID NO: 179, wherein SEQ ID 
NO: 42 is C-terminal to SEQ ID NO: 177. 

138. The method of claim 132, wherein said epidermal growth 
factor like domain comprises an amino acid sequence selected from 
the group consisting of SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 
156, SEQ ID NO: 157, SEQ ID NO: 158, and SEQ ID NO: 159. 

139. A method for inducing myelination of a neural cell by a 
glial cell, comprising contacting said cell with an amount of a 
polypeptide which binds the pl85 erb B2 receptor, sufficient to 
induce myelination of a neural cell by said glial cell. 

140. A method of inducing myelination of a neural cell by a 
glial cell, comprising contacting said glial cell with an amount of 
a recombinant polypeptide with glial cell mitogenic activity 
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sufficient to induce myelination of a neural cell by said glial 
cell. 



REMARKS 

This amendment is supported by the specification. Entry is 
requested. 

Respectfully submitted, 
FELFE & LYNCH 



B y l^feii Q 

Madeline F. Baer 
Reg. No. 36,437 

805 Third Avenue 
New York, N.Y. 10022 
(212) 688-9200 



- 4 - 




CNS 5250.26-JEL/MFB 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant (s) 
Serial No. 
Filed 
For 



Andrew Goodearl, et al . 

Continuation of Serial No. 08/471,833 

Concurrently herewith 

GLIAL MITOGENIC FACTORS, THEIR 
PREPARATION AND USE 



October 22, 1996 

Hon. Commissioner of Patent 

and Trademarks 
Washington, D.C. 20231 

ASSOCIATE POWER OF ATTORNEY 

Sir: 



I hereby grant and/or confirm Associate Power of Attorney to 
Peter F. Felfe, Reg. No. 20,297; Alfred H. Hemingway, Jr., Reg. No. 
26,736; Vincent M. Fazzari, Reg. No. 26,879; Norman D. Hanson, Reg. 
No. 30,946; F. Brice Faller, Reg. No. 29,532; Andrew L. Tiajoloff, 
Reg. No. 31,575; John A. Bauer, Reg. No. 32,554; Vineet Kohli, Reg. 
No. 37,003; Mary Ann Schofield, Reg. No. 36,669 and Madeline F. 
Baer, Reg. No. 3 6,43 7 to prosecute this application and to transact 
all business in the Patent and Trademark Office connected 
therewith. 




Reg. No. 2 0,94 0 

805 Third Avenue 
New York, N.Y. 10022 
(212) 688-9200 



08/736019 




Cress Reference to Related ftppH cation 
This application is a continuation-in-part of Serial 
No. 07/965,173, filed October 23, 1992, Serial No. 
07/940,389, filed September 3, 1992, Serial No. 07/907,138, 
5 filed June 30, 1992 and Serial No. 07/863,703, filed 
April 3, 1992. 

Background of the invention 
This invention relates to polypeptides found in 
vertebrate species, which polypeptides are mitogenic growth 
10 factors for glial cells, including Schwann cells. The 
invention is also concerned with processes capable of 
producing such factors, and the therapeutic application of 
such factors. 

The glial cells of vertebrates constitute the 
15 specialized connective tissue of the central and peripheral 
nervous systems. Important glial cells include Schwann 
cells which provide metabolic support for neurons and which 
provide myelin sheathing around the axons of certain 
peripheral neurons, thereby forming individual nerve fibers. 
20 Schwann cells support neurons and provide a sheath effect by 
forming concentric layers of membrane around adjacent neural 
axons, twisting as they develop around the axons. These 
myelin sheaths are a susceptible element of many nerve 
fibers, and damage to Schwann cells, or failure in growth 
25 and development, can be associated with significant 

demyelination or nerve degeneration characteristic of a 
number of peripheral nervous system diseases and disorders. 
In the development of the nervous system, it has become 
apparent that cells require various factors to regulate 
30 their division and growth, and various such factors have 
been identified in recent years, including some found to 
have an effect on Schwann cell division or development. 

- 2 - 



Thus, Brockes et al., inter alia, in J. 
Neuroscience, ± (1984) 75-83 describe a protein growth 
factor present in extracts front bovine brain and pituitary 
tissue, which was named Glial Growth Factor (GGF) . This 
5 factor stimulated cultured rat Schwann cells to divide 
against a background medium containing ten percent fetal 
calf serum. The factor was also described as having a 
molecular weight of 31,000 Daltons and as readily 
dimerizing. In Meth. Enz., 147 (1987), 217-225, Brockes 
10 describes a Schwann cell-based assay for GGF. 

Brockes et al. # supra, also describes a method of 
purification of GGF to apparent homogeneity. In brief, one 
large-scale purification method described involves 
extraction of the lyophilized bovine anterior lobes and 
15 chromatography of material obtained thereby using NaCl 

gradient elution from CM cellulose. Gel filtration is then 
carried out with an Ultrogel column, followed by elution 
from a phosphocellulose column, and finally, small-scale SDS 
gel electrophoresis. Alternatively, the CM-cellulose 
20 material was applied directly to a phosphocellulose column, 
fractions from the column were pooled and purified by 
preparative native gel electrophoresis, followed by a final 
SDS gel electrophoresis. 

Brockes et al. observe that in previously reported 
25 gel filtration experiments (Brockes et al., J. Biol. Chem. 
255 (1980) 8374-8377), the major peak of growth factor 
activity was observed to migrate with a molecular weight of 
56,000 Daltons, whereas in the first of the above-described 
procedures activity was predominantly observed at molecular 
30 weight 31,000. It is reported that the GGF dimer is largely 
removed as a result of the gradient elution from 
CM-cellulose in this procedure. 
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Benveniste et al. (PNAS, SZ (1985), 3930-3934) 
describe a T lymphocyte-derived glial growth promoting 
factor. This factor, under reducing conditions, exhibits a 
change in apparent molecular weight on SDS gels. 

Kimura et al. (Nature, 21& (1990), 257-260) describe 
a factor they term Schwannoma-derived growth factor (SDGF) 
which is obtained from a sciatic nerve sheath tumor. The 
authors state that SDGF does not stimulate the incorporation 
of tritium-labelled TdR into cultured Schwann cells under 
conditions where, in contrast, partially purified pituitary 
fraction containing GGF is active. SDGF has an apparent 
molecular weight of between 31,000 and 35,000. 

Davis and Stroobant (J. Cell. Biol., H£ (1990), 
1353-1360) describe the screening of a number of candidate 
mitogens. Rat Schwann cells were used, the chosen candidate 
substances being examined for their ability to stimulate DNA 
synthesis in the Schwann cells in the presence of 10% FCS 
(fetal calf serum), with and without forskolin. One of the 
factors tested was GGF-carboxymethyl cellulose fraction 
(GGF-CM) , which was mitogenic in the presence of FCS, with 
and without forskolin. The work revealed that in the 
presence of forskolin, inter alia, platelet derived growth 
factor (PDGF) was a potent mitogen for Schwann cells, PDGF 
having previously been thought to have no effect on Schwann 
cells. 

Holmes et al. Science (1992) 256: 1205 and Wen et 
al. Cell (1992) §±i 559 demonstrate that DNA sequences which 
encode proteins binding to a receptor (pl85 erbB2 ) are 
associated with several human tumors. 

The pl85 erbB2 protein is a 185 kilodalton membrane 
spanning protein with tyrosine kinase activity. The protein 
is encoded by the erbB2 proto-oncogene (Yarden and Ullrich 
Ann. Rev. Biochem. Sit 443 (1988)). The erbB2 gene, also 



referred to as HER-2 (in human cells) and neu (in rat 
cells) , is closely related to the receptor for epidermal 
growth factor (EGF) . Recent evidence indicates that 
proteins which interact with (and activate the kinase of) 
5 pl85 erbB2 induce proliferation in the cells bearing pl85* rbB2 
K (Holmes et al. Science £5j&: 1205 (1992); Dobashi et al. 
Proc. Natl. Acad. Sci. ££_: 8582 (1991); Lupu et al. Proc. 
Natl. Acad. Sci. ££: 2287 (1992)). Furthermore, it is 
evident that the gene encoding pl85 erbB2 binding proteins 

10 produces a number of variably-sized, differentially-spliced 
RNA transcripts that give rise to a series of proteins, 
which are of different lengths and contain some common 
peptide sequences and some unique peptide sequences. This 
is supported by the differentially-spliced RNA transcripts 

15 recoverable from human breast cancer (HDA-MB-231) (Holmes et 
al. Science 253.: 1205 (1992) ). Further support derives 
from the wide size range of proteins which act as (as 
disclosed herein) ligands for the pl85 erbB2 receptor (see 
below) . 

20 Summary of the Invention 

In general the invention provides methods for 
stimulating glial cell (in particular, Schwann cell and glia 
of the central nervous system) mitogenesis, as well as new 
proteins exhibiting such glial cell mitogenic activity. In 

25 addition, DNA encoding these proteins and antibodies which 
bind these and related proteins are provided. 

The novel proteins of the invention include 
alternative splicing products of sequences encoding known 
polypeptides. Generally, these known proteins are members 

30 of the GGF/pl85 erbB2 family of proteins. 
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Specifically, the invention provides polypeptides of 
a specified formula, and DNA sequences encoding those 
polypeptides. The polypeptides have the formula 



WYBAZCX 

5 wherein WYBAZCX is composed of the amino acid 

sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141-147, 
/ 160, 161); wherein w comprises the polypeptide segment F, or 
is absent; wherein Y comprises the polypeptide segment E, or 
is absent; wherein Z comprises the polypeptide segment G or 

10 is absent; and wherein X comprises the polypeptide segments 
C/D HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, C/D' H, 
C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D' D, 
C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' H, C/D' D' HL, C/D' 
D' HKL, C/D C/D' D' H, C/D C/D' D' HL, or C/D C/D' D' HKL; 

15 provided that, either 

a) at least one of F, Y, B, A, 2, C, or X is of 
bovine origin; or 

b) Y comprises the polypeptide segment E; or 

c) X comprises the polypeptide segments C/D HKL, C/D 
20 D, C/D' HKL, C/D C/D' HKL, C/D C/D' D, C/D D' H, C/D D' HL, 

C/D D' HKL, C/D' D' H, C/D' D' HKL, C/D C/D' D' H, C/D C/D' 
D' HL, C/D C/D' D' HKL, C/D'H, C/D C/D'H, or C/D C/D' HL. 

In addition, the invention includes the DNA sequence 
comprising coding segments S 'FBA 3 ' as well as the with 
25 corresponding polypeptide segments having the amino acid 
sequences shown in Figure 31 (SEQ ID Nos. 136, 138, 139); 

the DNA sequence comprising the coding segments 
5 'FBA' 3# as well as the corresponding polypeptide segments 
having the amino acid sequences shown in Figure 31 (SEQ ID 
30 NOS. 136, 138, 140); 

the DNA sequence comprising the coding segments 
5# FEBA 3 ' as well as the corresponding polypeptide segments 



having the amino acid sequences shown in Figure 31 (SEQ ID 

Nos. 136-139); 

the DNA sequence comprising the coding segments 

5 *F£BA' 3, as veil as the corresponding polypeptide segments 
5 having the amino acid sequences shown in Figure 31 (SEQ ID 

Nos. 136-138, 140); and 

the DNA sequence comprising the polypeptide coding 

segments of the GGF2HBS5 cDNA clone (ATCC Deposit No. 75298, 

deposited September 2, 1992). 
10 The invention further Includes peptides of the 

formula FBA, FEB A, FBA' FEBA' and DNA sequences encoding 

these peptides wherein the polypeptide segments correspond 

to amino acid sequences shown in Figure 31, SEQ ID Nos. 

(136, 138 and 139), (136-139') and (136, 138 and 140^ and 
15 (136-138 and 140) respectively. The purified GGF-II 

polypeptide (SEQ ID No. 167) is also included as a part of 

the invention. 

Further included as an aspect of the invention are 

peptides and DNA encoding such peptides which are useful for 
20 the treatment of glia and in particular oligodendrocytes, 

microglia and astrocytes, of the central nervous system and 

methods for the administration of these peptides. 

The invention further includes vectors including DNA 

sequences which encode the amino acid sequences, as defined 
25 above. Also included are a host cell containing the 

isolated DNA encoding the amino acid sequences, as defined 

above. The invention further includes those compounds which 

bind the pl85 erbB2 receptor and stimulate glial cell 

mitogenesis in vivo and/or in vitro. 
30 Also a part of the invention are antibodies to the 

novel peptides described herein. In addition, antibodies to 

any of the peptides described herein may be used for the 

purification of polypeptides described herein. The 



antibodies to the polypeptides may also be used for the 
therapeutic inhibition of glial cell mitogenesis. 

The invention further provides a method for 
stimulating glial cell aitogenesis comprising contacting 
5 glial cells vith a polypeptide defined by the formula 

WYBAZCX 

wherein WYBAZCX is composed of the polypeptide 
segments shown in Figure 31 (SEQ ID Nos. 136-139, 141-147, 
160, 161); wherein W comprises the polypeptide segment F, or 

10 is absent wherein Y comprises the polypeptide segment E, or 
is absent; wherein Z comprises the polypeptide segment G or 
is absent; and wherein X comprises the polypeptide segment 
C/D HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, C/D' H, 
C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D' D, 

15 C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' H, C/D' D' HL, C/D' 
D' HKL, C/D C/D' D' H, C/D C/D' D' HL, or C/D C/D' D' HKL. 

The invention also includes a method for the 
preparation of a glial cell mitogenic factor which consist 
of culturing modified host cells as defined above under 

20 conditions permitting expression of the DNA sequences of the 
invention. 

The peptides of the invention can be used to make a 
pharmaceutical or veterinary formulation for pharmaceutical 
or veterinary use. Optionally, the formulation may be used 

25 together with an acceptable diluent, carrier or excipient 
and/or in unit dosage form. 

A method for stimulating mitogenesis of a glial cell 
by contacting the glial cell with a polypeptide defined 
above as a glial cell mitogen in vivo or in vitro is also an 

30 aspect of the invention. A method for producing a glial 

cell mitogenic effect in a vertebrate (preferably a mammal, 
more preferably a human) by administering an effective 



amount of a polypeptide as defined is also a component of 
the invention. 

Methods for treatment of diseases and disorders 
using the polypeptides described are also a part of the 
5" invention. For instance, a method of treatment or 
prophylaxis for a nervous disease or disorder can be 
effected vith the polypeptides described* Also included are 
a method for the prophylaxis or treatment of a 
pathophysiological condition of the nervous system in which 
10 a cell type is involved which is sensitive or responsive to 
a polypeptide as defined are a part of the invention. 

Included in the invention as well, are methods for 
treatment when the condition involves peripheral nerve 
damage; nerve damage in the central nervous system; 
15 neurodegenerative disorders; demyelination in peripheral or 
central nervous system; or damage or loss of Schwann cells 
oligodendrocytes, microglia, or astrocytes. For example a 
neuropathy of sensory or motor nerve fibers; or the 
treatment of a neurodegenerative disorder are included. In 
20 any of these cases, treatment consists of administering an 
effective amount of the polypeptide. 

The invention also includes a method for inducing 
neural regeneration and/or repair by administering an 
effective amount of a polypeptide as defined above. Such a 
25 medicament is made by administering the polypeptide with a 
pharmaceutical^ effective carrier. 

The invention includes the use of a polypeptide as 
defined above in the manufacture of a medicament. 

The invention further includes the use of a 
30 polypeptide as defined above 

-to immunize a mammal for producing antibodies, 
which can optionally be used for therapeutic or diagnostic 
purposes 
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-in a competitive assay to identify or quantify 
molecules having receptor binding characteristic*; 
corresponding to those of the polypeptide; and/or 

-for contacting a sample with a polypeptide, as 
5" mentioned above, along with a receptor capable of binding 
specifically to the polypeptide for the purpose of detecting 
competitive inhibition of binding to the polypeptide. 

-in an affinity isolation process, optionally 
affinity chromatography, for the separation of a 
10 corresponding receptor. 

The invention also includes a method for the 
prophylaxis or treatment of a glial tumor. This method 
consists of administering an effective amount of a substance 
which inhibits the binding of a factor as defined by the 
15 peptides above. • 

Furthermore, the invention includes a method of 
stimulating glial cell mitogenic activity by the application 
to the glial cell of a 

-30 kD polypeptide factor isolated from the KDA - MB 

20 231 human breast cell line; or 

-35 kD polypeptide factor isolated from the rat I-EJ 
transformed fibroblast cell line to the glial cell or 

-75 kD polypeptide factor isolated from the SKBR-3 

human breast cell line; or 
25 -44 kD polypeptide factor isolated from the rat I-EJ 

transformed fibroblast cell line; or 

-25kD polypeptide factor isolated from activated 
mouse peritoneal macrophages; or 

-45 kD polypeptide factor isolated from the KDA - MB 

30 231 human breast cell; or 

-7 to 14 kD polypeptide factor isolated from the 
ATL-2 human T-cell line to the glial cell; or 
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-25 kD polypeptide factor isolated from the bovine 
Xidney cells; or 

-42 kD polypeptide factor (ARIA) isolated from 

brains . 

5 The invention further includes a method for the use 

Of the EGFL1 , EGFL2, EGFL3, EGFL4, EGFL5, and EGFL6 
polypeptides, Figure 38 to 43 and SEQ ID Kos. 154 to 159, 
respectively, for the stimulation of glial cell mitogenesis 
in vivo and in vitro . 

10 Also included in the invention is the administration 

of the GGF-II polypeptide whose sequence is shown in Figure 
45 for the stimulation of glial cell mitogenesis. 

An additional aspect of the invention includes the 
use of the above-referenced peptides for the purpose of 

15 stimulating Schwann cells to produce growth factors which 
may, in turn, be harvested for scientific or therapeutic 
use. 

Furthermore, the peptides described herein may be 
used to induce central glial proliferation and remyelination 
20 for treatment of diseases, e.g., MS, where re-myelination is 
desired. 

In an additional aspect of the invention, the novel 
polypeptides described herein may be used to stimulate the 
synthesis of acetylcholine receptors. 

25 As mentioned above, the invention provides new glial 

growth factors from mammalian sources, including bovine and 
human, which are distinguished from known factors. These 
factors are mitogenic for Schwann cells against a background 
of fetal calf plasma (FCP) . The invention also provides 

30 processes for the preparation of these factors, and an 
improved method for defining activity of these and other 
factors. Therapeutic application of the factors is a 
further significant aspect of the invention. 
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Thus, important aspects of the invention are: 
(a) a basic polypeptide factor having glial cell 
mitogenic activity, more specifically, Schwann cell 
mitogenic activity in the presence of fetal calf plasma, a 
5 molecular weight of from about 30 kD to about 36 JcD, and 
including within its amino acid sequence any one or sore of 
the following peptide sequences: 



FKGDAHTE 

ASLADEYEYKXK 
10 TETSSSGLXLK 

ASLADEYEYMRK 

AGYFAEXAR 

TTEMASEQGA 

AKEALAALK 
15 FVLQAKK 

ETQPDPGQILKKVPMVIGAYT 

EYKCLKFKWFKKATVM 

£ X K F Y V P 

KLEFLXAK; and 



20 (b) a basic polypeptide factor which stimulates 

glial cell mitogenesis, particularly the division of Schwann 
cells, in the presence of fetal calf plasma, has a molecular 
weight of from about 55 IcD to about 63 kD, and including 
within its amino acid sequence any one or more of the 

25 following peptide sequences: 

VHQVWAAK 
YIFFKEPEAXSSG 
LGAWGPPAFPVXY 
WFVVIEGK 
30 ASPVSVGSVQELQR 
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VCLLTVAALPPT 
KVHQVWAAK 
KASLADSGEYMXK 
D L L L X V 

EGKVHPQRRCALDRK 

PSCGRLKEDSRYIFFHE 

ELHRKKKPQKIKIQKK 

The novel peptide sequences set out above, derived 
from the smaller molecular weight polypeptide factor, and 
from the larger molecular weight polypeptide factor, are 
also aspects of this invention in their own right. These 
sequences are useful as probe sources for polypeptide 
factors of the invention, for investigating, isolating or 
preparing such factors (or corresponding gene sequences) 
from a range of different species, or preparing such factors 
by recombinant technology, and in the generation of 
corresponding antibodies, by conventional technologies, 
preferably monoclonal antibodies, which are themselves 
useful investigative tools and are possible therapeutics. 
The invention also includes an isolated glial cell mitogenic 
activity encoding gene sequence, or fragment thereof, 
obtainable by the methods set out above for the novel 
peptide sequences of the invention. 

The availability of short peptides from the highly 
purified factors of the invention has enabled additional 
sequences to be determined (see Examples to follow). 

Thus, the invention further embraces a polypeptide 
factor having glial cell. mitogenic activity and including an 
amino acid sequence encoded by: 

(a) a DNA sequence shown in any one of Figures 28a, 
28b or 28c, SEQ ID Nos. 133-135, respectively; 



(b) a DNA sequence shown in Figure 22, SEQ ID No. 

89; 

(c) the DNA sequence represented by nucleotides 
281-557 of the .sequence shown in Figure 28a, SEQ ID Ho. 133; 
or 

(d) a DNA sequence hybridizable to any one of the 
DNA sequences according to (a) , (b) or (c) • 

The invention further includes sequences which have 
greater than 60%, preferably 80%, sequence identity of 
homology to the sequences indicated above. 

Khile the present invention is not limited to a 
particular set of hybridization conditions, the following 
protocol gives general guidance which Bay, if desired, be 
followed: 

DNA probes may be labelled to high specific activity 
(approximately 10 8 to 10 9 32 Pdmp/Mg) by nick-translation or 
by PCR reactions according to Schowalter and Sommer (Anal. 
Biochem., 177:90-94, 1989) and purified by desalting on 
G-150 Sephadex columns. Probes may be denatured (10 minutes 
in boiling water followed by immersion into ice water) , then 
added to hybridization solutions of 80% buffer B (2g 
polyvinylpyrolidine, 2g Ficoll-400, 2g bovine serum albumin, 
50ml 1 M Tris HCL (pH 7.5), 58g NaCl, lg sodium 
pyrophosphate, lOg sodium dodecyl sulfate, 950ml H 2 0) 
containing 10% dextran sulfate at 10 6 dpm 32 P per ml and 
incubated overnight (approximately 16 hours) at 60' C. The 
filters may then be washed at 60' C, first in buffer B for 15 
minutes followed by three 20-minute washes in 2X BSC, 0.1% 
SDS then one for 20 minutes in lx BSC, 0.1% SDS. 

In other respects, the invention provides: 
(a) a basic polypeptide factor which has, if 
obtained from bovine pituitary material, an observed 
molecular weight, whether in reducing conditions or not, of 



from about 30kD to about 36kD on SDS-polyacrylamide gel 
electrophoresis using the following molecular weight 
standards : 

Lysozyme (hen egg white) 14,400 
S Soybean trypsin inhibitor 21 , 500 

Carbonic anhydrase (bovine) 31,000 
Ovalbumin (hen egg white) 45,000 
Bovine serum albumin 66,200 
Phosphorylase B (rabbit muscle) 97,400; 

10 which factor has glial cell mitogenic activity including 
stimulating the division of rat Schwann cells in the 
presence of fetal calf plasma, and when isolated using 
reversed-phase HPLC retains at least 50% of said activity 
after 10 weeks incubation in 0*1% trif luoroacetic acid at 

15 4*C; and 

(b) a basic polypeptide factor which has, if 
obtained from bovine pituitary material, an observed 
molecular weight, under non-reducing conditions, of from 
about 55 kD to about 63 kD on SDS-polyacrylamide gel 

20 electrophoresis using the following molecular weight 
standards: 



Lysozyme (hen egg white) 14,400 

Soybean trypsin inhibitor 21,500 

Carbonic anhydrase (bovine) 31,000 

25 Ovalbumin (hen egg white) 45,000 

Bovine serum albumin 66 , 200 

Phosphorylase B (rabbit muscle) 97,400; 



which factor the human equivalent of which is encoded by DNA 
clone GGF2HBS5 described herein and which factor has glial 
30 cell mitogenic activity including stimulating the division 
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of rat Schwann cells in the presence of fetal calf plasma, 
and when isolated using reversed-phase HPLC retains at least 
50% of the activity after 4 days incubation in 0.1% 
trif luoroacetic acid at 4 # C. 
5 For convenience of description only, the lower 

molecular weight and higher molecular weight factors of this 
invention are referred to hereafter as "GGF-I" and "GGF-II", 
respectively. The *GGF2" designation is used for all clones 
isolated with peptide sequence data derived from GGF-II 

10 protein (i.e., GGF2HBS5, GGF2BPP3) . 

It will be appreciated that the molecular weight 
range limits quoted are not exact, but are subject to slight 
variations depending upon the source of the particular 
polypeptide factor. A variation of, say, about 10% would 

15 not, for example, be impossible for material from another 
source. 

Another important aspect of the invention is a DNA 
sequence encoding a polypeptide having glial cell mitogenic 
activity and comprising: 
20 (a) a DNA sequence shown in any one of Figures 28a, 

28b or 28c, SEQ ID Nos. 133-135: 

(b) a DNA sequence shown in Figure 22, SEQ ID No. 

89; 

(c) the DNA sequence represented by nucleotides 

25 2B1-557 of the sequence shown in Figure 28a, SEQ ID No. 133; 
or 

(d) a DNA sequence hybridizable to any one of the 
DNA sequences according to (a) , (b) or (c) . 

Another aspect of the present invention uses the 
30 fact that the Glial Growth Factors and pl85 9rbB2 ligand 
proteins are encoded by the same gene. A variety of 
messen ger RNA splicing variants (and their resultant 
proteins) are derived from this gene and many of these 
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products show pl85 erbB2 binding and activation. Several of 
the (GGF-II) gene products have been used to show Schwann 
cell mitogenic activity. This invention provides a use for 
all of the known products of the GGF/pl85 erbB2 ligand gene 
5, (described in the references listed above) as Schwann cell 
mitogens. 

This invention also relates to other, not yet 
naturally isolated splicing variants of the Glial Growth 
factor gene. Figure 30, shows the known patterns of splicing 

10 derived from polymerase chain reaction experiments (on 
reverse transcribed UNA) and analysis of cDNA clones (as 
presented within) and derived from what has been published 
as sequences encoding pl85 erbB2 ligands (Peles et al., Cell 
££:205 (1992) and Wen et al. r Cell 1£:559 (1992)). These 

15 patterns, as well as additional ones disclosed herein, 
represent probable splicing variants which exist. Thus 
another aspect of the present invention relates to the 
nucleotide sequences encoding novel protein factors derived 
from this gene. The invention also provides processes for 

20 the preparation of these factors. Therapeutic application 
of these new factors is a further aspect of the invention. 

Thus other important aspects of the invention are : 

(a) A series of human and bovine polypeptide 
factors having glial cell mitogenic activity including 

25 stimulating the division of Schwann cells. These peptide 
sequences are shown in Figures 31, 32, 33 and 34, SEQ ID 
Nos. 136-137, respectively. 

(b) A series of polypeptide factors having glial 
cell mitogenic activity including stimulating the division 

30 of Schwann cells and purified and characterized according to 
the procedures outlined by Lupu et al. Science 2j&£: 1552 
(1990) ; Lupu et al. Proc. Natl. Acad. Sci USA ££: .2287 
(1992); Holmes et al. Science 2J£: 1205 (1992); Peles et al. 
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££: 205 (1992); Yarden and Peles Biochemistry ifi: 3543 
(1991) ; Dobashi et al. Proc. Natl. Acad. Sci. 8582 
(1991); Davis et al. Biochem. Biophys. Res. Commun. 179 : 
1536 (1991); Beaumont et al., patent application 
-5 PCT/US91/03443 (1990); Greene et al. patent application 
PCT/US91/02331 (1990); Usdin and Fischbach, J. Cell. Biol. 
103:493-507 (1986); Falls et al., Cold Spring Harbor Symp. 
Quant. Biol. 55:397-406 (1990); Harris et al., Proc. Natl. 
Acad. Sci. USA 88:7664-7668 (1991); and Falls et al., Cell 

10 72:801-815 (1993). 

(c) A polypeptide factor (GGFBPP5) having glial 
cell mitogenic activity including stimulating the division 
of Schwann cells. The amino acid sequence is shown in 
Figure 32, SEQ ID No. 148, and is encoded by the bovine DNA 

15 sequence shown in Figure 32, SEQ ID No. 148. 

The novel human peptide sequences described above 
and presented in Figures 31, 32, 33 and 34, SEQ ID Nos. 136< 
150, respectively, represent a series of splicing variants 
which can be isolated as full length complementary DNAs 

20 (cDNAs) from natural sources (cDNA libraries prepared from 
the appropriate tissues) or can be assembled as DNA 
constructs with individual exons (e.g., derived as separate 
exons) by someone skilled in the art. 

Other compounds in particular, peptides, which bind 

25 specifically to the pl85 erbB2 receptor can also be used 
according to the invention as a glial cell mitogen. A 
candidate compound can be routinely screened for pl85* rbB2 
binding, and, if it binds, can then be screened for glial 
cell mitogenic activity using the methods described herein. 

30 The invention includes any modifications or 

equivalents of the above polypeptide factors which do not 
exhibit a significantly reduced activity. For example, 
modifications in which amino acid content or sequence is 
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altered without substantially adversely affecting activity 
are included. By way of illustration, in EP-A 109748 
mutations of native proteins are disclosed in which the 
possibility of unwanted disulfide bonding is avoided by . 
5 replacing any cysteine in the native sequence which is not 
- necessary for biological activity with a neutral amino acid. 
The statements of effect and use contained herein are 
therefore to be construed accordingly, with such uses and 
effects employing modified or equivalent factors being part 
10 of the invention. 

The new sequences of the invention open up the 
benefits of recombinant technology. The invention thus also 
includes the following aspects: 

(a) DNA constructs comprising DNA sequences as 
15 defined above in operable reading frame position within 

vectors (positioned relative to control sequences so as to 
permit expression of the sequences) in chosen host cells 
after transformation thereof by the constructs (preferably 
the control sequence includes regulatable promoters, e.g. 
20 Trp) . It will be appreciated that the selection of a 

promoter and regulatory sequences (if any) are matters of 
choice for those of skill in the art; 

(b) host cells modified by incorporating constructs 
as defined in (a) immediately above so that said DNA 

25 sequences may be expressed in said host cells - the choice 
of host is not critical, and chosen cells may be prokaryotic 
or eukaryotic and may be genetically modified to incorporate 
said constructs by methods known in the art; and, 

(c) a process for the preparation of factors as 

30 defined above comprising cultivating the modified host cells 
under conditions permitting expression of the DNA sequences. 
These conditions can be readily determined, for any 
particular embodiment, by those of skill in the art of 
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recombinant DNA technology. Glial cell mitogens prepared by 
this means are included in the present invention. 

Hone of the factors described in the art has the 
combination of characteristics possessed by the present new 
5 polypeptide factors. 

As indicated, the Schwann cell assay used to 
characterize the present factors employs a background of 
fetal calf plasma. In all other respects, the assay can be 
the same as that described by Brockes et al. in Meth. Enz., 
10 supra, but with 10% FCP replacing 10% FCS. This difference 
in assay techniques is significant, since the absence of 
platelet-derived factors in fetal calf plasma (as opposed to 
serum) enables a more rigorous definition of activity on 
Schwann cells by eliminating potentially spurious effects 
15 from some other factors. 

The invention also includes a process for the 
preparation of a polypeptide as defined above, extracting 
vertebrate brain material to obtain protein, subjecting the 
resulting extract to chromatographic purification by 
20 hydroxylapatite HPLC and then subjecting these fractions to 
SDS-polyacrylamide gel electrophoresis. The fraction which 
has an observed molecular weight of about 30kD to 36 kD 
and/ or the fraction which has an observed molecular weight 
of about 55kD to 63 kD is collected. In either case, the 
25 fraction is subjected to SDS-polyacrylamide gel 

electrophoresis using the following molecular weight 
standards: 

Lysozyme (hen egg white) 14,400 
Soybean trypsin inhibitor 21,500 
30 carbonic anhydrase (bovine) 31,000 

Ovalbumin (hen egg white) 45,000 
Bovine serum albumin 66,200 
Phosphorylase B (rabbit muscle) 97,400 

- 20 - 



* 



in the case of the smaller molecular weight fraction, the 
SDS-polyacrylamide gel is run in non-reducing conditions in 
reducing conditions or, and in the case of the larger 
molecular weight fraction the gel is run under non-reducing 
5 conditions. The fractions are then tested for activity 
. stimulating the division of rat Schwann cells against a 
background of fetal calf plasma. 

Preferably, the above process starts by isolating a 
relevant fraction obtained by carboxymethyl cellulose 
10 chromatography, e.g. from bovine pituitary material. It is 
also preferred that hydroxy 1 apatite HPLC, cation exchange 
chromatography, gel filtration, and/or reversed-phase HPLC 
be employed prior to the SDS-Polyacrylamide gel 
electrophoresis. At each stage in the process, activity may 
15 be determined using Schwann cell incorporation of 

radioactive iododeoxyuridine as a measure in an assay 
generally as described by Brockes in Meth. Enz., supra, but 
modified by substituting 10% TCP for 10% PCS. As already 
noted, such as assay is an aspect of the invention in its 
20 own substance for CNS or PKS cell, e.g. Schwann cell, 

mitogenic effects. 

Thus, the invention also includes an assay for glial 
cell mitogenic activity in which a background of fetal calf 
plasma is employed against which to assess DNA synthesis in 
25 glial cells stimulated (if at all) by a substance under 
assay. 

Another aspect of the invention is a pharmaceutical 
or veterinary formulation comprising any factor as defined 
above formulated for pharmaceutical or veterinary use, 
30 respectively, optionally together with an acceptable 

diluent, carrier or excipient and/or in unit dosage form. 
In using the factors of the invention, conventional 
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pharmaceutical or veterinary practice may be employed to 
provide suitable formulations or compositions. 

Thus, the formulations of this invention can be 
applied to parenteral administration, for example, 
intravenous, subcutaneous, intramuscular, intraorbital, 
opthalmic, intraventricular, intracranial, intracapsular, 
intraspinal, intracisteraal, intraperitoneal, topical, 
intranasal, aerosol, scarification, and also oral, buccal, 
rectal or vaginal administration. 

The formulations of this invention may also be 
administered by the transplantation into the patient of host 
cells expressing the DNA of the instant invention or by the 
use of surgical implants which release the formulations of 

the invention. 

Parenteral formulations may be in the form of liquid 
solutions or suspensions; for oral administration, 
formulations may be in the form of tablets or capsules; and 
for intranasal formulations, in the form of powders, nasal 
drops, or aerosols. 

Methods well known in the art for making 
formulations are to be found in, for example, "Remington's 
Pharmaceutical Sciences." Formulations for parenteral 
administration may, for example, contain as excipients 
sterile water or saline, polyalkylene glycols such as 
polyethylene glycol, oils of vegetable origin, or 
hydrogenated naphthalenes, biocompatible, biodegradable 
lactide polymer, or polyoxyethylene-polyoxypropylene 
copolymers may be used to control the release of the present 
factors. Other potentially useful parenteral delivery 
systems for the factors include ethylene-vinyl acetate 
copolymer particles, osmotic pumps, implantable infusion 
systems, and liposomes. Formulations for inhalation may 
contain as excipients, for example, lactose, or may be 



aqueous solutions containing, for example, 
polyoxyethylene-9-lauryl ether, glycocholate and 
deoxycholate, or amy be oily solutions for administration in 
the form of nasal drops, or as a gel to be applied 
5 intranasally. Formulations for parenteral administration 
may also include glycocholate for buccal administration, 

t methoxysalicylate for rectal administration, or citric acid * 
for vaginal administration. 

The present factors can be used as the sole active 

10 agents, or can be used in combination with other active 
ingredients, e.g., other growth factors which could 
facilitate neuronal survival in neurological diseases, or 
peptidase or protease inhibitors. 

The concentration of the present factors in the 

25 formulations of the invention will vary depending upon a 
number of issues, including the dosage to be administered, 
and the route of administration. 

In general terms, the factors of this invention may 
be provided in an aqueous physiological buffer solution 

20 containing about 0.1 to 10% w/v compound for parenteral 

administration. General dose ranges are from about 1 mg/kg 
to about 1 g/kg of body weight per day; a preferred dose 
range is from about 0.01 mg/kg to 100 mg/kg of body weight 
per day. The preferred dosage to be administered is likely 

25 to depend upon the type and extent of progression of the 
pathophysiological condition being addressed, the overall 
health of the patient, the make up of the formulation, and 
the route of administration. 

As indicated above, Schwann cells (the glial cells 

30 of the peripheral nervous system) are stimulated to divide 
in the presence of the factors of the invention. Schwann 
cells of the peripheral nervous system are involved in 
supporting neurons and in creating the myelin sheath around 
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individual nerve fibers . This sheath is important for 
proper conduction of electrical impulses to muscles and from 
sensory receptors. 

There are a variety of peripheral neuropathies in - 
5 which Schwann cells and nerve fibers are damaged, either 
primarily or secondarily. There are many neuropathies of 
both sensory and motor fibers (Adams and Victor, Principles 
of Neurology) . The most important of those neuropathies are 
probably the neuropathies associates with diabetes, multiple 

10 sclerosis, Landry-Guillain-Barr syndrome, neuropathies 
caused by carcinomas, and neuropathies caused by toxic 
agents (some of which are used to treat carcinomas) . 

The invention, however, envisages treatment or 
prophylaxis of conditions where nervous system damage has 

15 been brought about by any basic cause, e.g. infection or 

injury. Thus, in addition to use of the present factors in 
the treatment of disorders or diseases of the nervous system 
where demyelination or loss of Schwann cells is present, 
such glial growth factors can be valuable in the treatment 

20 of disorders of the nervous system that have been caused by 
damage to the peripheral nerves. Following damage to 
peripheral nerves, the regeneration process is led by the 
growth or the re-establishment of Schwann cells, followed by 
the advancement of the nerve fibre back to its target. By 

25 speeding up the division of Schwann cells one could promote 
the regenerative process following damage. 

Similar approaches could be used to treat injuries 
or neurodegenerative disease of the central nervous system 
(brain and spinal cord) . 

30 Furthermore, there are a variety of tumors of glial 

cells the most common of which is probably 
neurofibromatosis, which is a patchy small tumor created by 
overgrowth of glial cells. Also, it has been found that an 
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activity very much like GGF can be found in some Schwann 
cell tumors, and therefore inhibitors of the action of the 
present factors on their receptors provides a therapy of a 
glial tumor, vhich comprises administering an effective 
5 amount of a substance vhich inhibits the binding of a 
factor, as defined above, to a receptor. 

In general, the invention includes the use of 
present polypeptide factors in the prophylaxis or treatment 
of any pathophysiological condition of the nervous system in 
10 vhich a f actor-sensitive or factor-responsive cell type is 
involved • 

The polypeptide factors of the invention can also be 
used as immunogens for making antibodies, such as monoclonal 
antibodies, following standard techniques. Such antibodies 

15 are included within the present invention. These antibodies 
can, in turn, be used for therapeutic or diagnostic 
purposes. Thus, conditions perhaps associated with abnormal 
levels of the factor may be tracked by using such 
antibodies. In vitro techniques can be used, employing 

20 assays on isolated samples using standard methods. Imaging 
methods in which the antibodies are, for example, tagged 
with radioactive isotopes which can be imaged outside the 
body using techniques for the art of tumour imaging Bay also 
be employed. 

25 The invention also includes the general use of the 

present factors as glial cell mitogens in vivo or in vitro, 
and the factors for such use. One specific embodiment is 
thus a method for producing a glial cell mitogenic effect in 
a vertebrate by administering an effective amount of a 

30 factor of the invention. A preferred embodiment is such a 
method in the treatment or prophylaxis of a nervous system 
disease or disorder. 
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A further general aspect of the invention is the use 
of a factor of the invention in the manufacture of a 
medicament, preferably for the treatment of a nervous 
disease or disorder, or for neural regeneration or repair* 
5* Also included in the invention are the use of the 

factors of the invention in competitive assays to identify 
or quantify molecules having receptor binding 
characteristics corresponding to those of said polypeptides. 
The polypeptides may be labelled, optionally with a 

10 radioisotope. A competitive assay can identify both 
antagonists and agonists of the relevant receptor. 

In another aspect, the invention provides the use of 
each one of the factors of the invention in an affinity 
isolation process, optionally affinity chromatography, for 

15 the separation of a respective corresponding receptor. Such 
processes for the isolation of receptors corresponding to 
particular proteins are known in the art, and a number of 
techniques are available and can be applied to the factors 
of the present invention. For example, in relation to IL-6 

20 and IFNy the reader is referred to Novick, D. ; et al., J. 
Chr omatogr . (1990) 510 ; 331-7. With respect to gonadotropin 
releasing hormone reference is made to Hazum, £*, J. (1990) 
Chromatogr. 510 :233-8. In relation to G-CSF reference is 
made to Fukunaga, R. , et al., J. Biol. Chem. , Z&Ht 13386-90. 

25 In relation to IL-2 reference is made to Smart, J.E., et 
al., (1990) J. Invest. Dermatol., £4 :158S-163S, and in 
relation to human IFN-gamma reference is made to Stefanos, 
S, (1989) J« Interferon Res., £:7l9-30. 

Brief Description of the Drawings 
30 The drawings will first be described. 

prawinos 
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Figures 1 to 8 relate to Example 1, and are briefly 
described below: 

Fig. 1 is the profile for product from carboxymethyl 
cellulose- chromatography; 
5 Fig. 2 is the profile for product from 

hydroxy lapatite HPLC; 

Fig. 3 is the profile for product from Mono S FPLC; 

Fig. 4 is the profile for product from Gel 
filtration FPLC; 
10 Figs. 5 and € depict the profiles for the two 

partially purified polypeptide products from reversed-phase 
HPLC; and 

Figs. 7 and 8 depict dose-response curves for the 
GGF-I and GGF-II fractions from reversed-phase HPLC using 

15 either a fetal calf serum or a fetal calf plasma background; 

Figs. 9 to 12 depict the peptide sequences derived 
from GGF-I and GGF-II, SEQ ID Nos. 1-20, 22-29, 32-53 and 
169, (see Example 2 hereinafter), Figures 10 and 12 
specifically depict novel sequences: 

20 In Fig. 10, Panel A, the sequences of GGF-I peptides 

used to design degenerate oligonucleotide probes and 
degenerate PCR primers are listed (SEQ ID Nos. 20; '1, 22-29, 
and 17). Some of the sequences in Panel A were also used to 
design synthetic peptides. Panel B is a listing of the 

25 sequences of novel peptides that were too short (less than 6 
amino acids) for the design of degenerate probes or 
degenerate PCR primers (SEQ ID Kos. 17 and 52); 

In Fig. 12, Panel A, is a listing of the sequences 
of GGF-II peptides used to design degenerate oligonucleotide 

30 probes and degenerate PCR primers (SEQ ID Kos. 45-52). Some 
of the sequences in Panel A were used to design synthetic 
peptides. Panel B is a listing of the novel peptide that 



was too short (less than 6 amino acids) for the design of 
degenerate probes or degenerate PCR primers (SEQ ID No. 53); 

Figures 13 to 20 relate to Example 3, below and 
depict the mitogenic activity of factors of the invention; 
? % Figures 21 to 28 (a, b and c) relate to Example 4, 

below and are briefly described below: 

Fig. 21 is a listing of the degenerate 
oligonucleotide probes (SEQ ID Nos. 54-88) designed from the 
novel peptide sequences in Figure 10, Panel A and Figure 12, 

10 Panel A; 

Fig. 22 (SEQ ID No. 89) depicts a stretch of the 
putative bovine GGF-II gene sequence from the recombinant 
bovine genomic phage GGF2BG1, containing the binding site of 
degenerate oligonucleotide probes 609 and 650 (see Figure 

15 21, SEQ ID NOS. 69 and 72, respectively). The figure is the 
coding strand of the DNA sequence and the deduced amino acid 
sequence in the third reading frame. The sequence of 
peptide 12 from factor 2 (bold) is part of a 66 amino acid 
open reading frame (nucleotides 75272) ; 

20 Fig. 23 is the degenerate PCR primers (Panel A, SEQ 

IS Nos. 90-108) and unique PCR primers (Panel B, SEQ ID Nos. 
109-119) used in experiments to isolate segments of the 
bovine GGF-II coding sequences present in RNA from posterior 
pituitary; 

25 Fig. 24 depicts of the nine distinct contiguous 

bovine GGF-II cDNA structures and sequences that were 
obtained in PCR amplification experiments using the list of 
primers in Figure 7, Panels A and B, and RNA from posterior 
pituitary. The top line of the Figure is a schematic of the 

30 coding sequences which contribute to the cDNA structures 
that were characterized; 

Fig. 25 is a physical map of bovine recombinant 
phage of GGF2BG1. The bovine fragment is roughly 20 kb in 
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length and contains two exons (bold) of the bovine GGF-II 
gene. Restriction sites for the enzymes Xbal, Spel, Ndel, 
EcoRI, Kpnl, and SstI have been placed on this physical sap. 
Shaded portions correspond to fragments which were subcloned 
5 for sequencing; 

Fig* 26 is a schematic of the structure of three 
alternative gene products of the putative bovine GGF-II 
gene* Exons are listed A through £ in the order of their 
discovery* The alternative splicing patterns 1, 2 and 3 

10 generate three overlapping deduced protein structures 

(GGF2BPP1, 2, and 3), which are displayed in the various 
Figures 28a, b, c (described below); 

rig. 27 (SEQ ID Nos. 120-132) is a comparison of the 
GGF-I and GGF-II sequences identified in the deduced protein 

15 sequences shown in Figures 28a, 28b and 28c (described 

below) with the novel peptide sequences listed in Figures 10 
and 12. The Figure shows that six of the nine novel GGF-II 
peptide sequences are accounted for in these deduced protein 
sequences* Two peptide sequences similar to GGF-I sequences 

20 are also found; 

Fig* 28a (SEQ ID No. 133) is a listing of the coding 
strand DNA sequence and deduced amino acid sequence of the 
cDNA obtained from splicing pattern number 1 in Figure 26. 
This partial cDNA of the putative bovine GGF-II gene encodes 

25 a protein of 206 amino acids in length. Peptides in bold 

were those identified from the lists presented in Figures 10 
and 12. Potential glycosylation sites are underlined (along 
with polyadenylation signal AATAAA) ; 

Fig. 28b (SEQ ID No. 134) is a listing of the coding 

30 strand DNA sequence and deduced amino acid sequence of the 
cDNA obtained from splicing pattern number 2 in Figure 26. 
This partial cDNA of the putative bovine GGF-II gene encodes 
a protein of 281 amino acids in length. Peptides in bold 
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are those identified from the lists presented in Figures 10 
and 12. Potential glycosylation sites are underlined (along 
vith polyadenylation signal AATAAA) ; 

Tig. 2Bc (SEQ ID No. 135) is a listing of the coding 
5 strand DNA sequence and deduced amino acid sequence of the 
cDNA obtained from splicing pattern number 3 in Figure 26. 
This partial cDNA of the putative bovine GGF-II gene encodes 
a protein of 257 amino acids in length. Peptides in bold 
are those identified from the lists in Figures 10 and 12. 

10 Potential glycosylation sites are underlined (along vith 
polyadenylation signal AATAAA) . 

Fig. 29, vhich relates to Example 6 hereinafter, is 
an autoradiogram of a cross hybridization analysis of 
putative bovine GGF-XI gene sequences to a variety of 

15 mammalian DNAs on a southern blot. The filter contains 

lanes of EcoRI -digested DNA (5 Mg per lane) from the species 
listed in the Figure. The probe detects a single strong 
band in each DNA sample, including a four kilobase fragment 
in the bovine DNA as anticipated by the physical map in 

20 Figure 25. Bands of relatively minor intensity are observed 
as veil, vhich could represent related DNA sequences. The 
strong hybridizing band from each of the other mammalian DNA 
samples presumably represents the GGF-II homologue of those 
species. 

25 Fig* 30 is a diagram of representative splicing 

variants. The coding segments are represented by F, E # B, 
A, G, C, C/D, C/D', D, D' # H, K and L. The location of the 
peptide sequences derived from purified protein are 
indicated by f, o w . 

30 Fig. 31 (SEQ ID Nos. 136-147, 160, 161) is a listing 

of the DNA sequences and predicted peptide sequences of the 
coding segments of GGF. Line 1 is a listing of the 
predicted amino acid sequences of bovine GGF, line 2 is a 



listing of the nucleotide sequences of bovine GGF, line 3 is 
a listing of the nucleotide sequences of human GGF 
(heregulin) (nucleotide base matches are indicated with a 
vertical line) and line 4 is a listing of the predicted • 
amino acid sequences of human GGF/heregulin vhere it differs 
from the predicted bovine sequence. Coding segments E, A' 
and K represent only the bovine sequences. Coding segment 
D' represents only the human (heregulin) sequence. 

Fig. 32 (SEQ ID Ho. 148) is the predicted GGF2 amino 
acid sequence and nucleotide sequence of BPP5. The upper 
line is the nucleotide sequence and the lower line is the 
predicted amino acid sequence. 

Fig. 33 (SEQ ID No. 149) is the predicted amino acid 
sequence and nucleotide sequence of GGF2BPP2. The upper 
line is the nucleotide sequence and the lower line is the 
predicted amino acid sequence. 

Fig. 34 (SEQ ID No. 150) is the predicted amino acid 
sequence and nucleotide sequence of GGF2BPP4. The upper 
line is the nucleotide sequence and the lower line is the 
predicted amino acid sequence. 

Fig. 35 (SEQ ID Nos. 151-152) depicts the alignment 
of two GGF peptide sequences (GGF2bpp4 and GGF2bpp5) with 
the human EGF (hEGF) . Asterisks indicate positions of 
conserved cysteines. 

Fig. 36 depicts the level of GGF activity (Schwann 
cell autogenic assay) and tyrosine phosphorylation of a ca. 
200kD protein (intensity of a 200 kD band on an 
autoradiogram of a Western blot developed with an 
antiphosphotyrosine polyclonal antibody) in response to 
increasing amounts of GGF. 

Fig. 37 is a list of splicing variants derived from 
the sequences shown in Figure 31. 
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Fig. 38 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFL1 (SEQ ID No. 

154) . 

Fig. 39 is the predicted amino acid sequence, 
5 bottom, and nucleic sequence, top, of EGFL2 (SEQ ID No. 

155) . 

Fig. 40 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFL3 (SEQ ID No. 

156) . 

10 Fig. 41 is the predicted amino acid sequence, 

bottom, and nucleic sequence, top, of EGFL4 (SEQ ID No. 

157) . 

Fig. 42 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFL5 (SEQ ID No. 
15 158). 

Fig. 43 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFL6 (SEQ ID No. 
159) . 

Fig. 44 is a scale coding segment map of the clone. 
20 T3 refers to the bacteriophage promoter used to produce mRNA 
from the clone. R « flanking EcoRI restriction enzyme 
sites. 5' UT refers to the 5' untranslated region. E, B, 
A, C, C/D', and D refer to the coding segments. O * the 
translation start site. A « the 5' limit of the region 
25 homologous to the bovine E segment (see example 6) and 3' UT 
refers to the 3' untranslated region. 

Fig. 45 is the predicted amino acid sequence 
(middle) and nucleic sequence (top) of GGF2HBS5 (SEQ ID No. 
167). The bottom (intermittent) sequence represents peptide 
30 sequences derived from GGF-II preparations (see Figures 11, 
12). 
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Fig. 46 is a graph depicting the Schwann cell 
mitogenic activity of recombinant human and bovine glial 
growth factors. 

Fig. 47 is a dose-response curve depicting Schwann 
5 cell proliferation activity data resulting from 

administration of different size aliguots of CHO cell 
conditioned medium. 

rig. 48 is a dose-response curve depicting Schwann 
cell mitogenic activity secreted into the extracellular 
10 medium by SF9 insect cells infected with baculovirus 
containing the GGF2HBS5 cDNA clone. 

Pig. 49 is a Western blot of recombinant CHO cell 
conditioned medium using a GGF peptide antibody. 

Pig. 50 (A) is a graph of Schwann cell proliferation 
15 activity of recombinant (COS cell produced) human GGF-II 

(rhGGF-II) peak eluted from the cation exchange column; (B) 
is an immunoblot against recombinant GGFII peak using 
polyclonal antibody made against specific peptide of 
rhGGFII; 

20 Pig. 51 (A) is. a graph showing the purification of 

rhGGF-II (CHO cell produced) on cation exchange column by 
fraction; (B) is a photograph of a Western blot using 
fractions as depicted in (A) and a rhGGF-II specific 
antibody. 

25 pig. 52 is a photograph of a gel depicting tyrosine 

phosphorylation in Schwann cells treated with recombinant 

glial growth factors. 

Pig. 53 is the sequences of GGFHBS5 , GGFHFB1 and 

GGFBPP5 polypeptides (SEQ ID NOS: 170, 171, and 172). 
30 pig. 54 is a map of the CHO cell-expression vector 

pcDHFRpolyA. 
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petailed Description 
The invention pertains to the isolation and 
purification of novel Glial Growth factors and the cloning 
of DNA sequences encoding these factors. Other components 
5 of the invention are several gene splicing variants which 
potentially encode a series of glial growth factors, in 
particular the GGF2HBS5 in particular a variant which 
encodes the human equivalent of bovine GGF-II. It is 
evident that the gene encoding GGF's and pl85* rbB2 binding 

10 proteins produces a number of variably-sized, 

differentially-spliced RNA transcripts that give rise to a 
series of proteins, which are of different lengths and 
contain some common peptide sequences and some unique 
peptide sequences. This is supported by the differentially- 

15 spliced sequences which are recoverable from bovine 

posterior pituitary RNA (as presented herein) , human breast 
cancer (KDA-MB-231) (Holmes et al. Science 256 : 1205 (1992) 
and chicken brain RNA (Falls et al. Cell 72:1-20 (1993)). 
Further support derives from the wide si2e range of proteins 

20 which act as both mitogens for Schwann cells (as disclosed 
herein) and as ligands for the pl85 erbB2 receptor (see 
below) . 

Further evidence to support the fact that the genes 
encoding GGF and pl85 erbB2 are homologous comes from 

25 nucleotide sequence comparison. Science, 256 (1992), 

1205-1210) Holmes £t al. demonstrate the purification of a 
45-kilodalton human protein (Heregulin-a) which specifically 
interacts with the receptor protein pl85 erbB2 , which is 
associated with several human malignancies. Several 

30 complementary DNA clones encoding Heregulin-a were isolated. 
Peles et al. (Cell ££:205 (1992)) and Wen et al (Cell ££:559 
(1992)) describe a complementary DNA isolated from rat cells 
encoding a protein called "neu differentiation factor" 
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(NDF) . The translation product of the NDF cDNA has 
pl85 erbB2 b i nd i ng activity. Usdin and Fischbach, J. Cell. 
Biol. 103:493-507 (1986); Falls et al., Cold Spring Harbor 
Symp. Quant. Biol. 55:397-406 (1990); Harris et al., Proc. 
5 Natl. Acad. Sci. USA 88:7664-7668 (1991); and Falls et al. f 
Cell 72:801-815 (1993) demonstrate the purification of a 42 
Kd glycoprotein which interacts with a receptor protein 
pl85 erbB2 and several complementary cDNA clones were 
isolated (Falls et al. Cell 72:801-815 (1993). Several 

10 other groups have reported the purification of proteins of 
various molecular weights with pl85 ,rbB2 binding activity. 
These groups include Lupu et al. (1992) Proc. Natl. Acad. 
Sci. USA 89:2287; Yarden and Peles (1991) Biochemistry 
30:3543; Lupu et al. (1990) Science 249:1552); Dobashi et 

25 al. (1991) Biochem. Biophys. Res. Comm. 179:1536; and Huang 
et al. (1992) J. Biol. Chem. 257:11508-11512. 

Other Embodiments 
The invention includes any protein which is 
substantially homologous to the coding segments in Figure 31 

20 (SEQ ID Ko.s 136-147, 160, and 161) as well as other 

naturally occurring GGF polypeptides. Also included are: 
allelic variations; natural mutants; induced mutants; 
proteins encoded by UNA that hybridizes under high or low 
stringency conditions to a nucleic acid naturally occurring 

25 (for definitions of high and low stringency see Current 

Protocols in Molecular Biology, John Kiley & Sons, New York, 
1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and 
polypeptides or proteins specifically bound by antisera to 
GGF polypeptide. The term also includes chimeric 

30 polypeptides that include the GGF polypeptides comprising 
sequences from Figure 31. 

The following examples are not intended to limit the 
invention, but are provided to usefully illustrate the same, 

- 35 - 



and provide specific guidance for effective preparative 
techniques. 

As will be seen from Example 3, below, the present 
factors exhibit mitogenic activity on a range of cell types. 
5 The activity in relation to fibroblasts indicates a wound 
repair ability, and the invention encompasses this use* The 
general statements of invention above in relation to 
formulations and/or medicaments and their manufacture should 
clearly be construed to include appropriate products and 

10 uses. This is clearly a reasonable expectation for the 

present invention, given reports of similar activities for 
fibroblast growth factors (FGFs)» Reference can be made, 
for example, to Sporn £t "Peptide Growth Factors and 

their Receptors I", page 396 (Baird and Bohlen) in the 

15 section headed "FGFs in Wound Healing and Tissue Repair". 

Purification of GGF-I and GGF-II from bovine Pituitaries 
1. Preparation of Factor-CK Fraction 

4,000 frozen whole bovine pituitaries (c.a. 12 kg) 

2 0 were thawed overnight, washed briefly with water and then 
homogenized in an equal volume of 0.15 K ammonium sulphate 
in batches in a Waring Blender. The homogenate was taken to 
pH 4.5 with 1.0 M HC1 and centrifuged at 4,900g for B0 
minutes. Any fatty material in the supernatant was removed 

25 by passing it through glass wool. After taking the pH of the 
supernatant to 6.5 using 1.0 H NaOH, solid ammonium sulphate 
was added to give a 36% saturated solution. After several 
hours stirring, the suspension was centrifuged at 4,900 g 
for 80 minutes and the precipitate discarded. After 

30 filtration through glass wool, further solid ammonium 
sulphate was added to the supernatant to give. a 75% 
saturated solution which was once again centrifuged at 4,900 
g for 80 minutes after several hours stirring. The pellet 



vas resuspended in c.a. 2 L of 0.1 K sodium phosphate pH 6.0 
and di&lyzed against 3 x 40 L of the same buffer. After 
confirming that the conductivity of the dialysate vas below 
20.0 ^Siemens, it vas loaded onto a Bioprocess column {120 x 
5 113 mm, Pharmacia) packed with carboxymethyl cellulose 

(CM-52, Whatman) at a flow rate of 2 ml ain" 1 . The column 
vas washed vith 2 volumes of 0.1 M sodium phosphate pH 6.0, 
followed by 2 volumes of 50 aM NaCl, and finally 2 volumes 
of 0.2 K NaCl both in the same buffer. During the final 
10 step, 10 mL (5 minute) fractions vere collected. Fractions 
73 to 118 inclusive vere pooled, dialyzed against 10 volumes 
of 10 mM sodium phosphate pH 6.0 twice and clarified by 
centrifugation at 100,000 g for 60 minutes. 

II. Hydroxy 1 apatite HPLC 

15 Hydroxy 1 apatite HPLC is not a technique hitherto 

used in isolating glial growth factors, but proved 
particularly efficacious in this invention. 
The material obtained from the above CM-cellulose 
chromatography was filtered through a 0.22 pm filter 

20 (Nalgene) , loaded at room temperature on to a high 

performance hydroxylapatite column (50 x 50 mm, Biorad) 
equipped with a guard column (15 x 25 mm, Biorad) and 
equilibrated with 10 mM potassium phosphate pH 6.0. Elution 
at room temperature vas carried out at a flow rate of 2 

25 ml. minute" 1 using the following programmed linear gradient: 

time (min) %B Solvent A: 10 mM potassium phosphate pH 6.0 



O.o 0 Solvent B: 1.0 M potassium phosphate pH 6.0 

5.0 0 

7.0 20 

30 70.0 20 

150.0 . 100 
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180.0 100 
185.0 0 

6.0 mL (3 minutes) fractions were collected during the 
gradient elution. Fractions 39-45 were pooled and dialyzed 
5 against 10 volumes of 50 mM sodium phosphate pH 6.0. 

TTT. Mono S FPLC 

Mono S FPLC enabled a more concentrated material to 
be prepared for subsequent gel filtration. 

Any particulate material in the pooled material from 
10 the hydroxylapatite column was removed by a clarifying spin 
at 100,000 g for 60 minutes prior to loading on to a 
preparative HR10/10 Mono S cation exchange column (100 x 10 
mm, Pharmacia) which was then re-equilibrated to 50mM sodium 
phosphate pH 6.0 at room temperature with a flow rate of 1.0 
15 ml /minute" 1 . Under these conditions, bound protein was 
eluted using the following programmed linear gradient: 

time (min) %B Solvent A: 50 mM potassium phosphate pH 6.0 



O.o 0 Solvent B: 1.2 M sodium chloride, 50 mm 

70.0 30 sodium phosphate pH 6.0 

20 240.0 100 

250.0 100 

260.0 0 



1 mL (l minute) fractions were collected throughout this 
gradient program. Fractions 99 to 115 inclusive were 
25 pooled. 

TV. Gel Fi H-rwtion FPLC 

This step commenced the separation of the two 
factors of the invention prior to final purification, 
producing enriched fractions.. 
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For the purposes of this step, a preparative 
Superose 12 FFLC column (510 x 20 mm, Pharmacia) was packed 
according to the manufacturers' instructions. In order to 
standardize this column, a theoretical plates measurement 
5 vas made according to the manufacturers' instructions, 
giving a value of 9,700 theoretical plates. 

The pool of Kono S eluted material vas applied at 
room temperature in 2.5 Ml aliguots to this column in 50mM 
sodium phosphate, 0.75 KaCl pH 6.0 (previously passed 

10 through a CIS reversed phase column (Sep-pak, Millipore) at 
a flow rate of 1.0 mL/minute* 1 . 1 mL (0.5 minute) fractions 
vere collected from 35 minutes after each sample vas applied 
to the column. Fractions 27 to 41 (GGF-II) and 42 to 57 
(GGF-I) inclusive from each run vere pooled. 

15 V. Reversed-Phase HPLC 

The GGF-I and GGF-II pools from the above Superose 
12 runs vere each divided into three equal aliguots. Each 
aliquot vas loaded on to a C8 reversed-phase column 
(Aguapore RP-3 00 7 ft C8 220 x 4.6 mm, Applied Biosystems) 

20 protected by a guard cartridge (RP-8, 15 x 3.2 mm, Applied 
Biosystems) and equilibrated to 40'C at 0.5 mX. minute. 
Protein vas eluted under these conditions using the 
following programmed linear gradient: 

time (min) %B Solvent A: 0.1% trif luoroacetic acid (TFA) 



25 0 Solvent B: 90% acetonitrile, 0.1% TFA 

60 66.6 

62.0 100 

72.0 100 

75.0 0 



30 200 nL (0.4 minute) fractions vere collected in siliconized 
tubes (Kultilube tubes, Bioquote) from 15.2 minutes after 
the beginning of the programmed gradient. 
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SPS-Folvacrvl&iftidg Gel Electrophoresis 

In this step, protein molecular weight standards , 
low range, catalogue no. 161-0304, from Bio-Rad Laboratories 
Limited, Katford, England were employed. The actual 
5 proteins used, and their molecular weight standards, have 
been listed herein previously. 

Fractions 47 to 53 (GGF-I) and fractions 61 to 67 
(GGFII) inclusive from the reversed-phase runs were 
individually pooled. 7 of the pooled material was boiled 
10 in an equal volume of 0.0125 M Tris-Cl, 4% SDS, 20% 

glycerol, and 10% B-mercaptoethanol for GGF-I, for 5 minutes 
and loaded on to an 11% polyacrylamide Laemmli gel with a 4% 
stacking gel and run at a constant voltage of 50 V for 16 
hours. This gel was then fixed and stained using a silver 
15 staining kit (Amersham) . Under these conditions, the 

factors are each seen as a some what diffuse band at relative 
molecular weights 30,000 to 36,000 Daltons (GGF-I) and 
55,000 to 63,000 Daltons (GGFII) as defined by molecular 
weight markers. From the gel staining, it is apparent that 
20 there are a small number of other protein species present at 
equivalent levels to the GGF-I and GGF-II species in the 
material pooled from the reversed-phase runs. 
VII. Stability in Trif luoroacetic Acid 

Stability data were obtained for the present Factors 
25 in the presence of trif luoroacetic acid, as follows:- 

GGF-I ; Material from the reversed-phase HFLC, in 
the presence of 0.1% TFA and acetonitrile, was assayed 
within 12 hours of the completion of the column run and then 
after 10 weeks incubation at 40 # C. Following incubation, 
30 the GGF-I had at least 50% of the activity of that material 
assayed directly off the column. 

GGF-II : Material from the reversed-phase HPLC, in 
the presence of 0.1% TFA and acetonitrile, and stored at 
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•20* C, was assayed after thawing and then after 4 days 
incubation at 40*C. Following incubation, the GGF-II had at 
least 50% of the activity of that material freshly thawed* 

It will be appreciated that the trif luoroacetic acid 
5 concentration used in the above studies is that most 
commonly used for reversed-phase chromatography. 

VIII - Activity Assay Conditions 

Unless otherwise indicated, all operations were 
conducted at 37 *C, and, with reference to Figures 1 to 6, 

10 activity at each stage was determined using the Brookes 
(Meth. Enz., supra) techniques with the following 
modifications. Thus, in preparing Schwann cells, 5 tM 
forskolin was added in addition to DMEM (Dulbecco's modified 
Eagle's medium) , FCS and GGF. Cells used in the assay were 

15 f ibroblast-free Schwann cells at passage number less than 
10, and these cells were removed from flasks with trypsin 
and plated into flat-bottomed 96-well plates at 3.3 thousand 
cells per microwell. 

[ 125 I]IUdR was added for the final 24 hours after the 

2 0 test solution addition. The background (unstimulated) 
incorporation to each assay was less than 100 cpm, and 
maximal incorporation was 20 to 200 fold over background 
depending on Schwann cell batch and passage number. 

In the case of the GGF-I and GGF-II fractions from 

25 reversed-phase HPLC as described above, two dose response 

curves were also produced for each factor, using exactly the 
above method for one of the curves for each factor, and the 
above method modified in the assay procedure only by 
substituting foetal calf plasma for fetal calf serum to 

30 obtain the other curve for each factor. The results are in 
Figures 7 and 8. 
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EXAMPLE 2 

ftr:ino acid sequences cf purified GGF-l and GGF-II 



Amino acid sequence analysis studies vere performed 
ttsing highly purified bovine pituitary GGF-I and GGF-II. 
5 The conventional single letter code vas used to describe the 
sequences . Peptides vere obtained by lysyl endopeptidase 
and protease VS digests, carried out on reduced and 
carboxymethylated samples, with the lysyl endopeptidase 
digest of GGF-II carried out on material eluted from the 

10 £5-65 RD region of a 11% SDS-PAGE (KW relative to the 
above-quoted markers) . 

A total of 21 peptide sequences (see Figure 9, SEQ 
ID Nos. 1-2 0, 169) vere obtained for GGF-I, of vhich 12 
peptides (see Figure 10, SEQ ID Nos. 1, 22-29, 17, 19, and 

15 32) are not present in current protein databases and 

therefore represent unique sequences, A total of 12 peptide 
sequences (see Figure 11, SEQ ID Nos. 33-44) vere obtained 
for GGF-II, of vhich 10 peptides (see Figure 12, SEQ ID Nos. 
4 5-53) are not present in current protein databases and 

20 therefore represent unique sequences (an exception is 

peptide GGF-II 06 vhich shows identical sequences in many 
proteins vhich are probably of no significance given the 
small number of residues) . These novel sequences are 
extremely likely to correspond to portions of the true amino 

25 acid sequences of GGFs I and II* 

Particular attention can be drawn to the sequences 
of GGF-I 07 and GGF-II 12, vhich are clearly highly related. 
The similarities indicate that the sequences of these 
peptides are almost certainly those of the assigned GGF 

30 species, and are most unlikely to be derived from 
contaminant proteins. 
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In addition, in peptide GGF-II 02, the sequence X S 
£ is consistent with the presence of an N linked 
carbohydrate moiety on an asparagine at the position denoted 
i>y X. 

5 In general, in Figures 9 and 11, X represents an 

unknown residue denoting a sequencing cycle where a single 
position could not be called with certainty either because 
there was more than one signal of equal size in the cycle or 
because no signal was present* As asterisk denotes those 

10 peptides where the last amino acid called corresponds to the 
last amino acid present in that peptide. In the remaining 
peptides, the signal strength after the last amino acid 
called was insufficient to continue sequence calling to the 
end of that peptide. The right hand column indicates the 

15 results of a computer database search using the GCG package 
FAST A and TFASTA programs to analyze the NBRF and EMBL 
sequence databases. The name of a protein in this column 
denotes identity of a portion of its sequence with the 
peptide amino acid sequence called allowing a maximum of two 

2 0 mismatches. A question mark denotes three mismatches 
allowed. The abbreviations used are as follows: 

HMG-1 High Mobility Group protein-1 

HMG-2 High Mobility Group protein-2 

LH-alpha Luteinizing hormone alpha subunit 

25 LH-beta Luteinizing hormone beta subunit 
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EXAMPLE 3 

Kitooenic Activity of Purified GGF- J »nd GGF-II 



The mitogenic activity of a highly purified sample 
containing both GGFs I and II was studied using a 
5 quantitative method, which allows a single microculture to 
be examined for DNA synthesis, cell morphology, cell number 
and expression of cell antigens. This technique has been 
modified from a method previously reported by Muir et ml., 
Analytical Biochemistry 185, 377-382, 1990. The main 

10 modifications are: 1) the use of uncoated microtiter plates, 
2) the cell number per well, 3) the use of 5% Foetal Bovine 
Plasma (FBP) instead of 10% Foetal Calf Serum (FCS) , and 4) 
the time of incubation in presence of mitogens and 
bronodeoxyuridine (BrdU) , added simultaneously to the 

15 cultures. In addition the cell monolayer was not washed 
before fixation to avoid loss of cells, and the incubation 
time of monoclonal mouse anti-BrdU antibody and peroxidase 
conjugated goat anti-mouse immunoglobulin (IgG) antibody 
vere doubled to increase the sensitivity of the assay. The 

20 assay, optimized for rat sciatic nerve Schwann cells, has 
also been used for several cell lines, after appropriate 
modifications to the cell culture conditions. 



T. Methods of Mitoge ns* s Testing 

On day 1, purified Schwann cells were plated onto 

25 uncoated 96 well plates in 5% FBP/Dulbecco's Modified Eagle 
Medium (DHEM) (5,000 cells/well) . On day 2, GGFs or other 
test factors were added to the cultures, as well as BrdU at 
a final concentration of 10 pm. After 48 hours (day 4) BrdU 
incorporation was terminated by aspirating the medium and 

30 cells were fixed with 200 jil/vell of 70% ethanol for 20 min 
at room temperature. Next, the cells were washed with water 
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and the DNA denatured by incubation with 100 til 2N HC1 for 
10 din at 37 *C. Following aspiration, residual acid was 
neutralized by filling the veils with 0.1 K borate buffer, 
pH 9.0, and the cells vere washed vith phosphate buffered 

5 saline (PBS) • Cells vere then treated vith 50 **1 of 

blocking buffer (PBS containing 0.1% Triton X 100 and 2% 
normal goat serum) for 15 min at 37* C. After aspiration, 
monoclonal mouse anti-BrdU antibody (Dako Corp., Santa 
Barbara, CA) (50 ^1/vell, 1.4 Mg/»1 diluted in blocking 

10 buffer) vas added and incubated for two hours at 37 # C. 
Unbound antibodies vere removed by three vashes in PBS 
containing 0.1% Triton X-100 and peroxidase-conjugated goat 
ant-mouse IgG antibody (Dako Corp., Santa Barbara, CA) (50 
fil/vell, 2 Mg/al diluted in blocking buffer) vas added and 

15 incubated for one hour at 37 # C. After three vashes in 
PBS/Triton and a final rinse in PBS, veils received 100 
Ml/vell of 50 mK phosphate/ citrate buffer, pH 5.0, 
containing 0.05% of the soluble chromogen o-phenylenediamine 
(OPD) and 0.02% H 2 0 2 . The reaction vas terminated after 

20 5-20 min at room temperature, by pipetting 80 pi from each 
veil to a clean plate containing 40 Ml/veil of 2N sulfuric 
acid. The absorbance vas recorded at 490nm using a plate 
reader (Dynatech Labs) . The assay plates containing the 
cell monolayers vere vashed tvice vith PBS and 

25 immunocytochemically stained for BrdU-DNA by adding 100 
111 /veil of the substrate diaminobenzidine (DAB) and 0.02% 
H 2 0 2 to generate an insoluble product. After 10-20 min the 
staining reaction vas stopped by vashing vith vater, and 
BrdU-positive nuclei observed and counted using an inverted 

30 microscope, occasionally, negative nuclei vere 

counterstained vith 0.001% Toluidine blue and counted as 
before. 
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Tl. Cell lines used for M itooenesis Assays 

Sviss 3T3 Fibroblasts: Cells, from Flow Labs, were 
maintained in DKEK supplemented with 10% FCS, penicillin and 
streptomycin, at 37 # C in a humidified atmosphere of 10% C0 2 
in air. Cells were fed or subcultured every two days. For 
mitogenic assay, cells were plated at a density of 5,000 
cells/well in complete medium and incubated for a week until 
cells were confluent and quiescent. The serum containing 
medium was removed and the cell monolayer washed twice with 
serum free-medium. 100 nl of serum free medium containing 
mitogens and 10mM of BrdU were added to each well and 
incubated for 48 hours. Dose responses to GGFs and serum or 
FDGF (as a positive control) were performed. 

SHK (Baby Eamster Kidney} 21 C13 Fibroblasts-. Cells 
from European Collection of Animal Cell Cultures (ECACC) , 
vere maintained in Glasgow Modified Eagle Medium (GMEM) 
supplemented with 5% tryptose phosphate broth, 5% FCS, 
penicillin and streptomycin, at 37*C in a humidified 
atmosphere of 5% C0 2 in air. Cells were fed or subcultured 
every two to three days. For mitogenic assay, cells were 
plated at a density of 2,000 cell/well in complete medium 
for 24 hours. The serum containing medium was then removed 
and after washing with serum free medium, replaced with 100 
jil of 0.1% FCS containing GMEM or GMEM alone. GGFs and FCS 
or bFGF as positive controls were added, coincident with 
10/*M BrdU, and incubated for 48 hours. Cell cultures were 
then processed as described for Schwann cells. 

C6 Fat Glioma Cell Line: Cells, obtained at passage 
39, were maintained in DMEM containing 5% FCS, 5% Horse 
serum (HS) , penicillin and streptomycin, at 37*C in a 
humidified atmosphere of 10% C0 2 in air. Cells were fed or 
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subcultures every three days. For mitogenic assay, cells 
vere plated at a density of 2,000 cells/veil in complete 
medium and incubated for 24 hours. Then medium vas replaced 
vith a mixture of 1:1 DMEK and F12 medium containing 0.1% 
5 FCS, after washing in serum free medium. Dose responses to 
GGFs, FCS and cFGF were then performed and cells vere 
processed through the ELISA as previously described for the 
other cell types. 

PC12 (Rat Adrenal Pheochromocytoma Cells) : Cells 
10 from ECACC, vere maintained in RPKI 1640 supplemented vith 
10% HS, 5% FCS i penicillin and streptomycin, in collagen 
coated flasks, at 37 «C in a humidified atmosphere of 5% C0 2 
in air. Cells vere fed every three days by replacing 80% of 
the medium. For mitogenic assay, cells vere plated at a 
15 density of 3,000 cells/veil in complete medium, on collagen 
coated plates (50 ^1/vell collagen, Vitrogen Collagen Corp. , 
diluted 1 : 50, 30 min at 37 *C) and incubated for 24 hours. 
The medium vas then placed vith fresh RPMI either alone or 
containing 1 mM insulin or 1% FCS. Dose responses to FCS/HS 
20 (1:2) as positive control and to GGFs vere performed as 
before. After 48 hours cells vere fixed and the ELISA 
performed as previously described. 

III. Results of Mitoaenesis Assays : All the experiments 

presented in this Example vere performed using a highly 
25 purified sample from a Sepharose 12 chromatography 

purification step (see Example 1, section D) containing a 

mixture of GGF-I and GGF-II (GGFs). 

First, the results obtained vith the BrdU 

incorporation assay vere compared vith the classical 
30 mitogenic assay for Schvann cells based on [125]I-UdR 



incorporation into DNA of dividing cells, described by 
J. P. Brookes (Methods Enzymol. 147:217, 1987). 

Figure 13 shows the comparison of data obtained with 
-the two assays, performed in the same cell culture 
5 conditions (5,000 cells/well, in 5% FBP/DMEM, incubated in 
presence of GGFs for 48hrs) . As clearly shown, the results 
are comparable, but BrdU incorporation assay appears to be 
slightly more sensitive, as suggested by the shift of the 
curve to the left of the graph, i.«. to lower concentrations 
10 of GGFS. 

As described under the section "Methods of 
Kitogenesis Testing", after the immunoreactive BrdU-DNA has 
been quantitated by reading the intensity of the soluble 
product of the OPD peroxidase reaction, the original assay 
15 plates containing cell monolayers can undergo the second 
reaction resulting in the insoluble DAB product, which 
stains the BrdU positive nuclei. The microcultures can then 
be examined under an inverted microscope, and cell 
morphology and the numbers of BrdU-positive and negative 
20 nuclei can be observed. 

In Figure 14a and Figure 14b the BrdU-DNA 
iiusunoreactivity, evaluated by reading absorbance at 490 nm, 
is compared to the number of BrdU-positive nuclei and to the 
percentage of BrdU-positive nuclei on the total number of 
25 cells per well, counted in the same cultures. Standard 

deviations were less than 10%. The two evaluation methods 
show a very good correlation and the discrepancy between the 
values at the highest dose of GGFs can be explained by the 
different extent of DNA synthesis in cells detected as 

30 BrdU-positive. 

The BrdU incorporation assay can therefore provide 
additional useful information about the biological activity 
of polypeptides on Schwann cells when compared to the (125) 
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I-UdR incorporation assay. For example, the data reported 
in Figure 15 show that GGFs can act on Schwann cells to 
induce DNA synthesis, but at lower doses to increase the 
number of negative cells present in the microculture after 
5 48 hours. 

The assay has then been used on several cell lines 
of different origin. In Figure 16 the mitogenic responses 
of Schwann cells and Swiss 3T3 fibroblasts to GGFs are 
compared; despite the veaX response obtained in 3T3 
10 fibroblasts, some clearly BrdU-positive nuclei were detected 
in these cultures. Control cultures were run in parallel in 
presence of several doses of FCS or human recombinant FDGF, 
showing that the cells could respond to appropriate stimuli 
(not shown) . 

15 The ability of fibroblasts to respond to GGFs was 

further investigated using the BHK 21 C13 cell line. These 
fibroblasts, derived from kidney, do not exhibit contact 
inhibition or reach a quiescent state when confluent. 
Therefore the experimental conditions were designed to have 

20 a very low background proliferation without compromising the 
cell viability. GGFs have a significant mitogenic activity 
on BKK21 C13 cells as shown by Figure 17 and Figure 18. 
Figure 17 shows the Brdu incorporation into DNA by BHK 21 
C13 cells stimulated by GGFS in the presence of 0.1% FCS. 

25 The good mitogenic response to FCS indicates that cell 
culture conditions were not limiting. In Figure 18 the 
mitogenic effect of GGFs is expressed as the number of 
BrdU-positive and BrdU-negative cells and as the total 
number of cells counted per veil. Data are representative 

30 of two experiments run in duplicates; at least three fields 
per well were counted. As observed for Schwann cells in 
addition to a proliferative effect at low doses, GGFs also 
increase the numbers of nonresponding cells surviving. The 
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percentage of BrdU positive cells is proportional to the 
increasing amounts of GGFs added to the cultures. The total 
number of cells after 48 hours in presence of higher doses 
of GGFs is at least doubled, confirming that GGFs induce DNA 
5 synthesis and proliferation in BHK21 C13 cells* Under the 
same conditions, cells maintained for 48 hours in the 
presence of 2% FCS shoved an increase of about six fold (not 
shown) ♦ 

C6 glioma cells have provided a useful model to 
10 study glial cell properties. The phenotype expressed seems 
to be dependent on the cell passage , the cells more closely 
resembling an astrocyte phenotype at an early stage, and an 
oligodendrocyte phenotype at later stages (beyond passage 
70). C6 cells used in these experiments vere from passage 
15 39 to passage 52. C6 cells are a highly proliferating 
population, therefore the experimental conditions vere 
optimized to have a very low background of BrdU 
incorporation. The presence of 0.1* serum was necessary to 
maintain cell viability without significantly affecting the 
20 mitogenic responses, as shown by the dose response to FCS 
{Figure 19) . 

In Figure 20 the mitogenic responses to aFGF (acidic 
Fibroblast growth factor) and GGFs are expressed as the 
percentages of maximal BrdU incorporation obtained in the 

25 presence of FCS (8%). Values are averages of two 

experiments, run in duplicates. The effect of GGFs was 
comparable to that of a pure preparation of aFGF* aFGF has 
been described as a specific growth factor for C6 cells (Lim 
R. et al., Cell Regulation 1:741-746, 1990) and for that 

30 reason it was used as a positive control. The direct 
counting of BrdU positive and negative cells was not 
possible because of the high cell density in the 
microcultures. In contrast to the cell lines so far 
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reported, PC12 cells did not show any evident responsiveness 
to GGFS, vhen treated under culture conditions in which PC12 
could respond to sera (mixture of PCS and HS as used 
routinely for cell maintenance) . Nevertheless the number of 
cells plated per well seems to affect the behavior of PC12 
cells, and therefore further experiments are required. 

EXAMPLE 4 

jgolating and Cloning of Ku ^gotide Sciences encoding 
proteins containing G6F-T p nd GGF-IT Peptides 

Isolation and cloning of the GGF-II nucleotide 
sequences was performed as outlined herein, using peptide 
sequence information and library screening, and was 
performed as set out below. It will be appreciated that the 
peptides of Figures 4 and 5 can be used as the starting 
point for isolation and cloning of GGF-I sequences by 
following the techniques described herein. Indeed, Figure 
21, SEQ ID Nos. 54-88) shows possible degenerate 
oligonucleotide probes for this purpose, and Figure 23, SEQ 
ID Nos. 90-119, lists possible PCR primers. DNA sequence 
and polypeptide sequence should be obtainable by this means 
as with GGF-II, and also DNA constructs and expression 
vectors incorporating such DNA sequence, host cells 
genetically altered by incorporating such 
constructs/vectors, and protein obtainable by cultivating 
such host cells. The invention envisages such subject 
matter. 

T. Des-Son and Synthesis o f oligonucleotide Probes and 

primers 

Degenerate DNA oligomer probes were designed by 
backtranslating the amino acid sequences (derived from the 
peptides generated from purified GGF protein) into 
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nucleotide sequences. Oligomers represented either the 
coding strand or the non-coding strand of the DNA sequence. 
Khen serine, arginine or leucine were included in the 
oligomer design, then two separate syntheses were prepared 
to avoid ambiguities. For example, serine was encoded by 
either TCN or AGY as in 537 and 538 or 609 and 610. Similar 
codon splitting was done for arginine or leucine (e.g. 544, 
545). DNA oligomers were synthesized on a Biosearch 8750 
4 -column DNA synthesizer using 6-cyanoethyl chemistry 
operated at 0.2 micromole scale synthesis. Oligomers were 
cleaved off the column (500 angstrom CpG resins) and 
deprotected in concentrated ammonium hydroxide for 6-24 
hours at 55-60'C. Deprotected oligomers were dried under 
vacuum (Speedvac) and purified by electrophoresis in gels of 
15% acrylamide (20 mono : 1 bis), 50 mM Tris-borate-EDTA 
buffer containing 7M urea. Pull length oligomers were 
detected in the gels by UV shadowing, then the bands were 
excised and DNA oligomers eluted into 1.5 mis H20 for 4-16 
hours with shaking. The eluate was dried, redissolved in 
0.1 ml H 2 0 and absorbance measurements were taken at 260nm. 

Concentrations were determined according to the 
following formula: 

(A 260 x units/ml) (60.6/length ■ x nK) 

All oligomers were adjusted to 50 nK concentration 
by addition of H 2 0. 

Degenerate probes designed as above are shown in 

Figure 21, SEQ ID tfos. 54-88. 

PCR primers were prepared by essentially the same 
procedures that were used for probes with the following 
modifications. Linkers of thirteen nucleotides containing 
restriction sites were included at the 5' ends of the 
degenerate oligomers for use in cloning into vectors. DNA 
synthesis was performed at 1 micromole scale using 1,000 



angstrom CpG resins and inosine vas used at positions where 
all four nucleotides were incorporated normally into 
degenerate probes. Purifications of PCR primers included an 
ethanol precipitation following the gel electrophoresis 
5 purification. 

IXj Library Construction and Screening 

A bovine genomic DNA library was purchased from 
Stratagene (Catalogue Number: 945701). The library 
contained 2 x 10 6 15-2 Okb Sau3Al partial bovine DNA 

10 fragments cloned into the vector lambda Dashll. A bovine 
total brain CDNA library was purchased from Clonetech 
(Catalogue Number: BL 10139). Complementary DNA libraries 
were constructed (In Vitrogen; Stratagene) from xaRNA 
prepared from bovine total brain, from bovine pituitary and 

15 from bovine posterior pituitary. In Vitrogen prepared two 
cDNA libraries: one library was in the vector lambda gio, 
the other in vector pcDNAI (a plasmid library) . The 
Stratagene libraries were prepared in the vector lambda 
unizap. Collectively, the cDNA libraries contained 14 

2 0 million primary recombinant phage. 

The bovine genomic library was plated on £. coli K12 
host strain LE392 on 23 x 23 cm plates (Nunc) at 150,000 to 
200,000 phage plagues per plate. Each plate represented 
approximately one bovine genome equivalent. Following an 

25 overnight incubation at 37 *c, the plates were chilled and 
replicate filters were prepared according to procedures of 
Kaniatis et al. (2:60-81). Four plague lifts were prepared 
from each plate onto uncharged nylon membranes (Pall Biodyne 
A or KSI Nitropure) . The DNA was immobilized onto the 

30 membranes by cross-linking under UV light for 5 minutes or, 
by baking at 80 *C under vacuum for two hours. DNA probes 
were labelled using T4 polynucleotide kinase (New England 



Biolabs) with gamma 32P ATP (New England Nuclear; 6500 
Ci/mmol) according to the specifications of the suppliers. 
Briefly, 50 pools of degenerate DNA oligomer were incubated 
in the presence of 600 »Ci gamma 32 P-ATP and 5 units T4 
5 polynucleotide kinase for 30 minutes at 37 *C. Reactions 
vere terminated, gel electrophoresis loading buffer was 
added and then radiolabeled probes vere purified by 
electrophoresis* 32P labelled probes vere excised from gel 
slices and eluted into vater. Alternatively, DNA probes 

10 vere labelled via PCR amplification by incorporation of 
a-32P-dATP or a-32P dCTP according to the protocol of 
Schovalter and Sommer, Anal. Biochem 177:90*94 (1989). 
Probes labelled in PCR reactions vere purified by desalting 
on Sephadex G-150 columns. 

15 Prehybridization and hybridization vere performed in 

GMC buffer (0.52 K NaPi, 7% SDS, 1% BSA, 1.5 ©M EDTA, O.l M 
KaCl 10 mg/ml tRNA) * Washing vas performed in oligovash 
(160 ml 1 K Na 2 HP0 4 , 200 ml 20% SDS, 8.0 ml 0.5 M EDTA, 100 
ml 5K KaCl, 3632 ml H20). Typically, 20 filters (400 sg. 

2 0 centimeters each) representing replicate copies of ten 
bovine genome equivalents vere incubated in 200 ml 
hybridization solution with 100 pmols of degenerate 
oligonucleotide probe (128-512 fold degenerate). 
Hybridization vas allowed to occur overnight at 5*C below 

25 the minimum melting temperature calculated for the 

degenerate probe. The calculation of minimum melting 
temperature assumes 2*C for an AT pair and 4*C for a GC 
pair. 

Filters vere vashed in repeated changes of oligovash 
30 at the hybridization temperatures four to five hours and 

finally, in 3.2K t etrame thy 1 ammonium chloride, 1% SDS twice 
for 30 min at a temperature dependent on the DNA probe 
length. For 20mers, the final vash temperature vas 60*C. 



Filters were mounted, then exposed to X-ray film (Kodak 
XAR5) using intensifying screens (Dupont Cronex Lightening 
Plus) . Usually, a three to five day film exposure at minus 
80 *c vas sufficient to detect duplicate signals in these 
5 library screens. Following analysis of the results, filters 
could be stripped and reprobed. Filters were stripped by 
incubating through two successive cycles of fifteen minutes 
in a microwave oven at full power in a solution of 1% SDS 
containing lOmM EDTA pH8. Filters were taken through at 
10 least three to four cycles of stripping and reprobing with 
various probes. 

III, Recombinant Phage Isolation, Growth and DNA 
Preparation 

These procedures followed standard protocol as 
15 described in Recombinant DNA (Kaniatis et al 2:60-2:81). 

IV, Analysis of Isolated Clones Using DNA Digestion and 
Southern Blots 

Recombinant Phage DNA samples (2 micrograms) were 
digested according to conditions recommended by the 

20 restriction endonuclease supplier (New England Biolabs) . 
Following a four hour incubation at 37 # C, the reactions 
products were precipitated in the presence of 0.1M sodium 
acetate and three volumes of ethanol. Precipitated DNA was 
collected by centrifugation, rinsed in 75% ethanol and 

25 dried. All resuspended samples were loaded onto agarose 

gels (typically 1% in TAE buffer; 0.04M Tris acetate, 0.002M 
EDTA) . Gel runs were at 1 volt per centimeter from 4 to 20 
hours. Markers included lambda Hind III DNA fragments 
and/or CX174HaeIII DNA fragments (New. England Biolabs). The 

30 gels were stained with 0.5 micrograms/ml of ethidium bromide 
and photographed. For southern blotting, DNA was first 

• 55 - 



depurinated in 'the gel by treatment vith 0.125 N HC1, 
denatured in 0.5 K KaOH and transferred in 2 Ox SSC (3M 
sodium chloride, 0.03 M sodium citrate) to uncharged nylon 
membranes. Blotting vas done for 6 hours up to 24 hours, 
then the filters were neutralized in 0.5 Tris HC1 pH 7.5, 
0.15 K sodium chloride, then rinsed briefly in 50 oM 
Tris -borate EDTA. 

For cross-linking, the filters were wrapped first in 
transparent plastic wrap, then the DNA side exposed for five 
minutes to an ultraviolet light. Hybridization and washing 
vas performed as described for library screening (see 
section 2 of this Example). For hybridization analysis to 
determine whether similar genes exist in other species 
slight modifications were made. The DNA filter was 
purchased from Clonetech (Catalogue Number 7753-1) and 
contains 5 micrograms of EcoRI digested DNA from various 
species per lane. The probe vas labelled by PCR 
amplification reactions as described in section 2 above, and 
hybridizations were done in 80% buffer B(2 g 
polyvinylpyrrolidine, 2 g Ficoll-400, 2 g bovine serum 
albumin, 50 ml IK Tris-HCl (pH 7.5) 58 g NaCl, 1 g sodium 
pyrophosphate, 10 g sodium dodecyl sulfate, 950ml H 2 0) 
containing 10* dextran sulfate. The probes were denatured 
by boiling for ten minutes then rapidly cooling in ice 
vater. The probe was added to the hybridization buffer at 
10 6 dpm 32 P per ml and incubated overnight at 60 *C. The 
filters were washed at 60 *C first in buffer B followed by 2X 
SSC, 0.1% SDS then in lx SSC, 0.1% SDS. For high 
stringency, experiments, final washes were done in 0.1 x 
SSC, 1% SDS and the temperature raised to 65 *C. 

Southern blot data were used to prepare a 
restriction map of the genomic clone and to indicate which 
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subfragments hybridized to the GGF probes (candidates for 
subcloning) . 

V. Sub clonino cf Segments of DNA Homologous to 
Hybridization Probjg; 

5 DNA digests (e.g. 5 micrograms) were loaded onto 1% 

agarose gels then appropriate fragments excised from the 
gels following staining. The DNA vas purified by adsorption 
onto glass beads followed by elution using the protocol 
described by the supplier (Bio 101). Recovered DNA 

10 fragments (100-200 ng) were ligated into linearized 

dephosphorylated vectors, e.g. pT3T7 (Ambion) , which is a 
derivative of pUC18, using T4 ligase (New England Biolabs) . 
This vector carries the E. coli £ lactamase gene, hence, 
trans forxnants can be selected on plates containing 

15 ampicillin. The vector also supplies B-galactosidase 

complementation to the host cell, therefore non-recombinants 
(blue) can be detected using isopropylthiogalactoside and 
Bluogal (Bethesda Research Labs). A portion of the ligation 
reactions was used to transform £. coli K12 XL1 blue 

20 competent cells (Stratagene Catalogue Number: 200236) and 

then the transf ormants were selected on LB plates containing 
50 micrograms per ml ampicillin. White colonies were 
selected and plasmid mini preps were prepared for DNA 
digestion and for DNA sequence analysis. Selected clones 

25 were retested to determine if their insert DNA hybridized 
with the GGF probes. 

VI. DNA Sequencing 

Double stranded plasmid DNA templates were prepared 
from 5 ml cultures according to standard protocols. 
3 0 Sequencing was by the dideoxy chain termination method using 
Sequenase 2.0 and a dideoxynucleotide sequencing kit (US 
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Biochemical) according to the manufacturers protocol (a 
modification of Sanger et al. PNAS; USA 7_£:5463 (1977)]. 
Alternatively, sequencing was done in a DNA thermal cycler 
(Perkin Elmer, model 4800) using a cycle sequencing kit (New 
England Biolabs; Bethesda Research Laboratories) and was 
performed according to manufacturers instructions using a 
5 '-end labelled primer. Sequence primers were either those 
supplied with the sequencing kits or were synthesized 
according to sequence determined from the clones. 
Sequencing reactions were loaded on and resolved on 0.4mm 
thick sequencing gels of 6% polyacrylamide . Gels were dried 
and exposed to X-Ray film. Typically, 35S was incorporated 
vhen standard sequencing kits were used and a 32P end 
labelled primer was used for cycle sequencing reactions. 
Sequences were read into a DNA sequence editor from the 
bottom of the gel to the top (5' direction to 3') and data 
vere analyzed using programs supplied by Genetics Computer 
Group (GCG, University of Wisconsin). 

VII. RNA Preparation n na PCR >^pT i f i cation 

Open reading frames detected in the genomic DNA and 
which contained sequence encoding GGF peptides were extended 
via PCR amplification of pituitary RNA. RNA was prepared 
from frozen bovine tissue (Pelfreeze) according to the 
guanidine neutral-CsCl procedure (Chirgwin et. al. 
Biochemistry 18:5294(1979).) Polyadenylated RNA was selected 
by oligo-dT cellulose column chromatography (Aviv and Leder 
PNAS (USA) €9:1408 (1972)). 

Specific DNA target sequences were amplified 
beginning with either total RNA or polyadenylated RNA 
samples that had been converted to cDNA using the Perkin 
Elmer PCR/RNA Kit Number: N808-0017. First strand reverse 
transcription reactions used 1 pg template RNA and either 



primers of oligo dT with restriction enzyme recognition site 
lingers attached or specific antisense primers determined 
from cloned sequences with restriction sites attached. To 
produce the second strand, the primers either were plus 
5 strand unique sequences as used in 3' RACE reactions 

(Frohman et. al., PNAS (USA) £5.:8998 (1988)) or were oligo 
dT primers with restriction sites attached if the second 
target site had been added by terminal transferase tailing 
first strand reaction products with dATP (e.g. 5' race 
10 reactions, Frohman et. ml,, ibid). Alternatively, as in 
anchored PCR reactions the second strand primers were 
degenerate, hence, representing particular peptide 
sequences. 

The amplification profiles followed the following 

15 general scheme: 1) five minutes soak file at 95 *C; 2) 

thermal cycle file of 1 minute, 95* C; 1 minute ramped down 
to an annealing temperature of 45'C, 50'C or 55*C; maintain 
the annealing temperature for one minute; ramp up to 72 *C 
over one minute; extend at 72 *C for one minute or for one 

20 minute plus a 10 second auto extension; 3) extension cycle 
at 72 *C, five minutes, and; 4) soak file 4'C for infinite 
tine. Thermal cycle files (#2) usually were run for 30 
cycles. A sixteen pi sample of each 100 til amplification 
reaction was analyzed by electrophoresis in 2% Nusieve 1% 

25 agarose gels run in TAE buffer at 4 volts per centimeter for 
three hours. The gels were stained, then blotted to 
uncharged nylon membranes which were probed with labelled 
DKA probes that were internal to the primers. 

Specific sets of DKA amplification products could be 

30 identified in the blotting experiments and their positions 
used as a guide to purification and reamplif ication. When 
appropriate, the remaining portions of selected samples were 
loaded onto preparative gels, then following electrophoresis 
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four to five slices of 0.5 mm thickness (bracketing the 
expected position of the specific product) were taken from 
the gel. The agarose was crushed, then soaked in 0.5 ml of 
electrophoresis buffer from 2-16 hours at 40 *C. The crushed 
5 agarose was centrifuged for two minutes and the aqueous 
phase was transferred to fresh tubes. 

Reamplif ication was done on five microliters 
(roughly 1% of the product) of the eluted material using the 
same sets of primers and the reaction profiles as in the 

10 original reactions. When the reamplif ication reactions 
were completed, samples were extracted with chloroform and 
transferred to fresh tubes. Concentrated restriction 
enzyme buffers and enzymes were added to the reactions in 
order to cleave at the restriction sites present in the 

15 linkers. The digested PCR products were purified by gel 
electrophoresis, then subcloned into vectors as described 
in the subcloning section above. DNA sequencing was done 
described as above. 



VTTT . DNA Sequence Analysis 

20 DNA sequences were assembled using a fragment 

assembly program and the amino acid sequences deduced by the 
GCG programs GelAssemble, Kap and Translate. The deduced 
protein sequences were used as a query sequence to search 
protein sequence databases using KordSearch. Analysis was 

25 done on a VAX Station 3100 workstation operating under VMS 
5.1. The database search was done on SwissProt release 
number 21 using GCG Version 7.0. 

TX. Results of Cloning and Se quencing of genes encoding 
cr.F-T and GGF-II 
30 As indicated above, to identify the DNA sequence 

encoding bovine GGF-II degenerate oligonucleotide probes 
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vere designed from GGF-II peptide sequences. GGF-II 12 (SEQ 
ID No. 44) , a peptide generated via lysyl endopeptidase 
digestion of a purified GGF-II preparation (see Figures 11 
and 12) shoved strohg amino acid sequence homology with 
5 GGF-I 07 (SEQ ID No. 39), a tryptic peptide generated from 
a purified GGF-I 

preparation. GGF-II 12 was thus used to create ten 
degenerate oligonucleotide probes (see oligos 609, 610 and 
649 to 656 in Figure 21 , SEQ ID Nos. 69, 70, 71 and 79, 

10 respectively) . A duplicate set of filters vere probed with 
two sets (set 1«609, 610; set 2=649-5656) of probes encoding 
two overlapping portions of GGF-II 12. Hybridization 
signals vere observed, but, only one clone hybridized to 
both probe sets. The clone (designated GGF2BG1) vas 

15 purified. 

Southern blot analysis of DNA from the phage clone 
GGF2BG1 confirmed that both sets of probes hybridized vith 
that bovine DNA sequence, and shoved further that both 
probes reacted vith the same set of DNA fragments vithin the 

20 clone. Based on those experiments a 4 kb Eco RI 

sub-fragment of "the original clone vas identified, subcloned 
and partially sequenced. Figure 22 shows the nucleotide 
sequence, SEQ ID No. 89) and the deduced amino acid sequence 
of the initial DNA sequence readings that included the 

25 hybridization sites of probes 609 and 650, and confirmed 

that a portion of this bovine genomic DNA encoded peptide 12 
(KASLADSGEYM) . 

Further sequence analysis demonstrated that GGF-II 
12 resided on a 66 amino acid open reading frame (see belov) 

30 vhich has become the starting point for the isolation of 

overlapping sequences representing a putative bovine GGF-II 
gene and a cDNA. 
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Several PCR procedures were used to obtain 
additional coding seguences for the putative bovine GGF-II 
gene. Total RNA and oligo dT-selected (poly A containing) 
RNA samples were prepared from bovine total pituitary, 
5 anterior pituitary, posterior pituitary, and hypothalaaus . 
Using primers froa the list shown in Figure 23, SEQ ID Nos. 
109-119, one-sided PCR reactions (RACE) were used to amplify 
cDNA ends in both the 3' and 5' directions, and anchored PCR 
reactions were performed with degenerate oligonucleotide 
10 primers representing additional GGF-II peptides. Figure 24 
summarizes the contiguous DNA structures and seguences 
obtained in those experiments. From the 3' RACE reactions, 
three alternatively spliced cDNA seguences were produced, 
which have been cloned and seguenced. A 5' RACE reaction 
15 led to the discovery of an additional exon containing coding 
seguence for at least 52 amino acids. Analysis of that 
deduced amino acid seguence revealed peptides GGF-II-6 and a 
seguence similar to GGF-I-18 (see below) . The anchored PCR 
reactions led to the identification of (cDNA) coding 
20 seguences of peptides GGF-II-1, 2, 3 and 10 contained within 
an additional cDNA segment of 300 bp. The 5' limit of this 
segment (i.e., segment E, see Fig. 31) is defined by the 
oligonucleotide which encodes peptide GGF-II-1 and which was 
used in the PCR reaction (additional 5' seguence data exists 
25 as described for the human clone in Example 6) . Thus this 
clone contains nucleotide seguences encoding six out of the 
existing total of nine novel GGF-II peptide seguences. 

The cloned gene was characterized first by 
constructing a physical map of GGF2BG1 that allowed us to 
30 position the coding seguences as they were found (see 

below. Figure 25). DNA probes from the coding seguences 
described above have been used to identify further DNA 
fragments containing the exons on this phage clone and to 
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identify clones that overlap in both directions. The 
putative bovine GGF-II gene is divided into at least 5 
coding segments. Coding segments are defined as discrete 
lengths of DNA sequence which can be translated into 
5 polypeptide sequences using the universal genetic code. The 
coding segments described in Figure 31 and referred to in 
the present application are: 1} particular exons present 
vithin the GGF gene (e.g. coding segment a) , or 2) derived 
from sets of two or more exons that appear in specific sub- 
10 groups of mRNAs, where each set can be translated into the 
specific polypeptide segments as in the gene products shown. 
The polypeptide segments referred to in the claims are the 
translation products of the analogous DNA coding segments. 
Only coding segments A and B have been defined as exons and 
15 sequenced and mapped thus far. The summary of the 

contiguous coding sequences identified is shown in Figure 
26. The exons are listed (alphabetically) in the order of 
their discovery. It is apparent from the intron/exon 
boundaries that exon B may be included in cDNAs that 
20 connect coding segment E and coding segment A. That is, 
exon B cannot be spliced out without compromising the 
reading frame. Therefore, we suggest that three 
alternative splicing patterns can produce putative bovine 
GGF-II cDNA sequences 1, 2 and 3. The coding sequences of 
25 these, designated GGF2BPP1.CDS, GGF2BPP2 . CDS and 

GGF2BPP3 . CDS , respectively, are given in Figures 28a (SEQ ID 
No. 133), 28b (SEQ ID No. 134), and 28c (SEQ ID No. 135), 
respectively. The deduced amino acid sequence of the three 
cDNAs is also given in Figures 28a, (SEQ ID No. 133), 28b 
30 (SEQ ID No. 134), and 28c (SEQ ID No. 135). 

The three deduced structures encode proteins of 
lengths 206, 281 and 257 amino acids. The first 183 
residues of the deduced protein sequence are identical in 
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all three gene products. At position 184 the clones differ 
significantly. A codon for glycine GGT in GGF2BPP1 also 
serves as a splice donor for GGF2BPP2 and GGF2BPP3, which 
alternatively add on exons C, C/D, C/D' and D or C, C/D and 
D, respectively, and shown in figure 33, SEQ ZD No. 149). 
GGFIIBPPl is a truncated gene product which is generated by 
reading past the coding segment A splice junction into the • 
following intervening sequence (intron). This represents 
coding segment A' in figure 31 (SEQ ID No. 140) . The 
transcript ends adjacent to a canonical AATAAA 
polyadenylation sequence, and we suggest that this truncated 
gene product represents a bona fide mature transcript. The 
other two longer gene products share the same 3' 
untranslated sequence and polyadenylation site. 

All three of these molecules contain six of the nine 
novel GGF-II peptide sequences (see Figure 12) and another 
peptide is highly homologous to GGF-I-18 (see Figure 27). 
This finding gives a high probability that this recombinant 
molecule encodes at least a portion of bovine GGF-II. 
Furthermore, the calculated isoelectric points for the three 
peptides are consistent with the physical properties of 
GGF-I and II. Since the molecular size of GGF-II is roughly 
€0 XD, the longest of the three cDNAs should encode a 
protein with nearly one-half of the predicted number of 
amino acids. 

A probe encompassing the B and A exons was labelled 
via PCR amplification and used to screen a cDNA library made 
from RNA isolated from bovine posterior pituitary. One 
clone (GGF2BPP5) showed the pattern indicated in figure 30 
and contained an additional DNA coding segment (G) between 
coding segments A and C. The entire nucleic acid sequence 
is shown in figure 32 (SEQ ID No. 148) . The predicted 
translation product from the longest open reading frame is 



241 amino acids. A portion of a second cDNA (GGF2BPP4) was 
also isolated from the bovine posterior pituitary library 
using the probe described above. This clone showed the 
pattern indicated in figure 30. This clone is incomplete at 
the 5' end, but is a splicing variant in the sense that it 
lacks coding segments C and D. BPP4 also displays a novel 
3' end with regions H, K and L beyond region C/D. The 
sequence of BPP4 is shown in figure 34 (SEQ ID Ho. 150) . 



EXAMPLE 5 

10 GGF sequences in Various Species 

Database searching has not revealed any meaningful 
similarities between any predicted GGF translation products 
and known protein sequences. This suggests that GGF-II is 
the first member of a new family or superfamily of proteins. 

15 in high stringency cross hybridization studies (DNA blotting 
experiments) with other mammalian DNAs we have shown, 
clearly, that DNA probes from this bovine recombinant 
molecule can readily detect specific sequences in a variety 
of samples tested. A highly homologous sequence is also 

20 detected in human genomic DNA. The autoradiogram is shown 
in figure 29. The signals in the lanes containing rat and 
human DNA represent the rat and human equivalents of the GGF 
gene, the sequences of several cDNA's encoded by this gene 
have been recently reported by Holmes et al. (Science 2!£: 

25 1205 (1992)) and Wen et al. (Cell 1£: 559 (1992)). 



EXAMPLE 6 

isolation r>f » Hu m *" Sequence Encoding Human 6GF2 
Several human clones containing sequences from the bovine 
GGFXX coding segment E were isolated by screening a human 
cDNA library prepared from brain stem (Stratagene catalog 
/935206). This strategy was pursued based on the strong 
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link between most of the GGF2 peptides (unique to GGF2) and 
the predicted peptide sequence from clones containing the 
bovine E segment* This library vas screened as described in 
Example 4, Section II using the oligonucleotide probes 914- 
5 919 listed below. 

914TCGGGCTCCATGAAGAAGATGTA 
9 1 5TCCATG AAG AAGATGTACCTGCT 
<S 1 6 ATGTACCTGCTGTCCTCCTTGA 
-9 17TTGAAGAAGGACTCGCTGCTCA 

10 riSAAAGCCGGGGGCTTGAAGAA 
9 1 9 ATGARGTGTGGGCGGCGAAA 

Clones detected with these probes were further 
analyzed by hybridization. A probe derived from coding 
segment A (see Figure 21} , which was produced by labeling a 

15 polymerase chain reaction (PCR) product from segment A, was 
also used to screen the primary library. Several clones 
that hybridized with both A and E derived probes were 
selected and one particular clone, GGF2HBS5, was selected 
for further analysis. This clone is represented by the 

20 pattern of coding segments (EBACC/D'D as shown in Figure 

31) . The E segment in this clone is the human equivalent of 
the truncated bovine version of E shown in Figure 37* 
GGF2HBS5 is the most likely candidate to encode GGF-II of 
all the "putative" GGF-XI candidates described. The length 

25 of coding sequence segment E is 786 nucleotides plus 264 
bases of untranslated sequence. The predicted size of the 
protein encoded by GGF2HBS5 is approximately 423 amino acids 
(approximately 45 kilodaltons, see Figure 45, SEQ ID NO: 
167), which is similar to the size of the deglycosylated 

30 form of GGF-II (see Example 16). Additionally, seven of the 
GGF-II peptides listed in Figure 27 have equivalent 
sequences which fall within the protein sequence predicted 
from region E. Peptides IX-6 and 11-12 are exceptions, 
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vhich fall in coding segment B and coding segment A, 
respectively. RKA encoding the GGF2HBS5 protein was 
produced in an in vitro transcription system driven by the 
bacteriophage T7 promoter resident in the vector (Bluescript 
5 SK [Stratagene Inc.] see Figure 44) containing the CGF2HBS5 
insert* This RNA was translated in a cell free (rabbit 
reticulocyte) translation system and the size of the protein 
product vas 45 Kd. Additionally, the cell-free product has 
been assayed in a Schwann cell mitogenic assay to confirm 
10 biological activity. Schwann cells treated with conditioned 
medium show both increased proliferation as measured by 
incorporation of 12s I-Uridine and phosphorylation on 

tyrosine of a protein in the 185 kilodalton range. I 
Thus the size of the product encoded by GGF2HBS5 and the 

15 presence of DNA sequences vhich encode human peptides highly 
homologous to the bovine peptides shown in Figure 12 confirm 
that GGF2KBS5 encodes the human equivalent of bovine GGF2. 
The fact that conditioned media prepared from cells 
transformed with this clone elicits Schwann cell mitogenic 

2 0 activity confirms that the GGFIIHBS5 gene produce (unlike 
the BPP5 gene product) is secreted. Additionally the 
GGFIIBPP5 gene product seems to mediate the Schwann cell 
proliferation response via a receptor tyrosine kinase such 
as pl85 erbB2 or a closely related receptor (see Example 14). 

25 EXAMPLE 7 

Expression of Human Recombinant' GGF2 in Mammalian and Insect 
Celts 

The GGF2KBS5 cDNA clone encoding human GGF2 (as 
described in Example 6 and also referred to herein as HBS5) 
30 vas cloned into vector pcDL-SRa296 (Takebe et al. Mol* Cell. 
Biol. 8:466-472 (1988) and COS-7 cells were transfected in 
100 mm dishes by the DEAE-dextran method (Sambrook et al. 
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Molecular Cloning: A Laboratory Manual 2nd ed. CSH 
Laboratory NY (1989). Cell lysates or conditioned media 
from transiently expressing COS cells were harvested at 3 or 
4 days post-transfection. To prepare lysates, cell 
5 monolayers vere washed with PBS, scraped from the dishes, 
lysed by three freeze/thaw cycles in 150 pi of 0.25 M Tris- 
HC1, pH 8. Cell debris was pelleted and the supernatant 
recovered. Conditioned media samples (7 ml.) were 
collected, then concentrated and buffer exchanged with 10 mM 
10 Tris, pH 7.4 using Centiprep-10 and Centricon-10 units as 
described by the manuf acturer (Amicon, Beverly, MA) • Rat 
nerve Schwann cells were assayed for incorporation of DNA 
synthesis precursors, as described (see Example 3). 
Conditioned media or cell lysate samples were tested in the 
15 Schwann cell proliferation assay as described in Example 3. 
The aitogenic activity data are shown in Fig. 46. The cDNA, 
GGF2KBS5, encoding GGF2 directed the secretion of the 
protein product to the medium. A small proportion of total 
activity was detectable inside the cells as determined by 
20 assays using cell lysates. GGF2HFB1 and GGFBPP5 cDNA's 
failed to direct the secretion of the product to the 
extracellular medium. GGF activity from these clones was 
detectable only in cell lysates (Fig. 46). 

Recombinant GGF2 was also expressed in CHO cells. 
25 The GGF2HBS5 cDNA encoding GGF2 was cloned into the EcoRI 
site of vector pcdhfrpolyA (Fig. 54) and transfected into 
the DHFR negative CHO cell line (DG44) by the calcium 
phosphate coprecipitation method (Graham and Van Der Eb, 
Virology 52:456-467 (1973). Clones were selected in 
30 nucleotide and nucleoside free a medium (Gibco) in 96-well 
plates. After 3 weeks, conditioned media samples from 
individual clones were screened for expression of GGF by the 
Schwann cell proliferation assay as described in Example 3. 



Stable clones which secreted significant levels of GGF 
activity into the medium were identified. Schwann cell 
proliferation activity data from different volume aliguots 
of CHO cell conditioned medium were used to produce the dose 
5 response curve shown in Fig. 47 (r«f., Graham and Van Der 
Eb, Virology 52:456, 1973). This material was analyzed on a 
Western blot probed with polyclonal antisera raised against 
a GGF2 specific peptide. A broad band of approximately 69- 
90 Kd (the expected site of GGF2 extracted from pituitary 
10 and higher molecular weight glycoforms) is specifically 
labeled (Fig. 49, lane 12). 

Recombinant GGF2 was also expressed in insect cells 
using Baculovirus expression. Sf9 insect cells were 
infected with baculovirus containing the GGF2HBS5 cDNA clone 
15 at a multiplicity of 3-5 (10* cells/ml) and cultured in 
Sf 900-11 medium (Gibco) . Schwann cell mitogenic activity 
was secreted into the extracellular medium (Fig. 48). 
Different volumes of insect cell conditioned medium were 
tested in the Schwann cell proliferation assay in the 
20 absence of forskolin and the data used to produce the dose 
response curve shown in Fig. 48. 

This material was also analyzed on a Western blot 
(Fig. 47) probed with the GGF II specific antibody described 
above. A band of 45 Kd, the size of deglycosylated GGF-II 
25 (see Example 16) was seen. 

The methods used in this example were as follows: 
Schwann cell aitogenic activity of recombinant human 
and bovine glial growth factors was determined as follows: 
Kitogenic responses of cultured Schwann cells were measured 
30 in the presence of 5 fM forskolin using crude recombinant 
GGF preparations obtained from transient mammalian 
expression experiments. Incorporation of [ 12 5 I] -Uridine was 
determined following an 18-24 hour exposure to materials 
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obtained from transfected or nock transfected COS cells as 
described in the Methods. The mean and standard deviation 
of four sets of data are shown. The mitogenic response to 
partially purified native bovine pituitary GGF 
(carboxymethyl cellulose fraction; Goodearl at al., 
submitted) is shown (GGF) as a standard of one hundred 
percent activity. 

cDNAs (Pig. S3) were cloned into pcDL-SRa296 (Takebe 
et al., Kol. Cell Biol. 8:466-472 (1988) ) r and COS-7 cells 
were transfected in 100 mm dishes by the DEAE-dextran nethod 
(Sambrook et al., In Molecular Cloning. A Laboratory Manual, 
2nd. md. (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989)). Cell lysates or conditioned media were 
harvested at 3 or 4 days post-transfection. To prepare 
lysates, cell monolayers were washed with PBS, scraped from 
the dishes, and lysed by three freeze/thaw cycles in 150 til 
of 0.25 K Tris-HCl, pH 8. Cell debris was pelleted and the 
supernate recovered. Conditioned media samples (7 mis) were 
collected, then concentrated and buffer exchanged with 10 mM 
Tris, pH 7.4 using Centriprep-10 and Centricon-10 units as 
described by the manufacturer (Amicon, Beverly, MA) . Rat 
sciatic nerve Schwann cells were assayed for incorporation 
of DNA synthesis precursors, as described (Davis and 
Stroobant, J. Cell Biol. 110:1353-1360 (1990); Brockes et 
al., Brain Res. 165:105-118 (1979)). 

Western blots of recombinant CHO cell conditioned 
medium were performed as follovs: A recombinant CHO clone 
was cultured in 7 ml. of MCDB302 protein-free medium for 3 
days. 2 ml of conditioned medium was concentrated, buffer 
exchanged against 10 mM Tris-HCl, pH 7.4 and lyophilized to 
dryness. The pellet was resuspended in SDS-PAGE sample 
buffer, subjected to reducing SDS gel electrophoresis and 
analyzed by Western blotting with a GGF peptide antibody. 1 
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CKO control was done by using conditioned medium from 
untransfected CHO-DG44 host and the CHO HBS5 levels were 
assayed using conditioned medium from a recombinant clone. 

EXAMPLE 8 

Isolation of Other Human Seeruences Related to Bovine GGF 
The result in Examples 5 and 6 indicate that GGF related 
sequences from human sources can also be easily isolated by 
using DNA probes derived from bovine GGF sequences. 
Alternatively the procedure described by Holmes et al. 
(Science 256 : 1205 (1992)) can be used. In this example a 
human protein (heregulin a) , which binds to and activates 
the pl85 erbB2 receptor (and is related to GGF), is purified 
from a tumor cell line and the derived peptide sequence is 
used to produce oligonucleotide probes which were utilized 
to clone the cDNA's encoding heregulin. The biochemical 
assay for pl85 erbB2 receptor activation is distinguished 
from Schwann cell proliferation. This is a similar approach 
to that used in examples 1-4 for the cloning of GGF 
sequences from pituitary cDNAs. The heregulin protein and 
complementary DNAs were isolated from tumor cell lines 
according to the following procedures. 

Heregulin was purified from medium conditioned by MDA-KB-231 
breast cancer cells (ATCC #HTB 26) grown on Percell 
Biolytica microcarrier beads (Hyclone Labs) . The medium (10 
liters) was concentrated -25-fold by filtration through a 
membrane (10-kD cutoff) (Millipore) and clarified by 
centrifugation and filtration through a filter (0.22 fin). 
The filtrate was applied to a heparin Sepharose column 
(Pharmacia) and the proteins were eluted with steps of 0.3, 
0.6, and 0.9 K NaCl in phosphate-buffered saline. Activity 
in the various chromatographic fractions was measured by 
quantifying the increase in tyrosine phosphorylation of 
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plE5 er*>B2 in KCF _ 7 breast tumor cells (ATCC # HTB 22). 
HCF-7 cells were plated in 24-well Costar plates in F12 
(50%) Dulbecco's minimum essential medium (50%) containing 
serum (10%) (10 s cells per veil), and allowed to attach for 
5 at least 24 hours. Prior to assay, cells were transferred 
into medium without serum for a minimum of 1 hour. Column 
fractions (10 to 100 pi) were incubated for 30 min. at 37*. 
Supernatants were then aspirated and the reaction was 
stopped by the addition of SDS-FAGE sample buffer 100 yl) . 

10 Samples were heated for 5 min. at 100*C, and portions (10 to 
15 ul) were applied to a tris-glycine gel (4 to 20%) 
(Novex) . After electrophoresis , proteins were 
electroblotted onto a polyvinylidenedif luoride (PVDF) 
membrane and then blocked with bovine serum albumin (5%) in 

15 tris-buf fered saline containing Tween-20 (0.05%) (TBST) . 
Blots were probed with a monoclonal antibody (1:1000 
dilution) to phosphotyrosine (Upstate Biotechnology) for a 
minimum of 1 hour at room temperature. Blots were washed 
with TBST, probed with an antibody to mouse immunoglobulin G 

20 conjugated to alkaline phosphatase (Promega) (diluted 
1:7500) for a minimum of 30 min. at room temperature. 
Peactive bands were visualized with 
5-bromo-4-chloro-3-indoyl-l-phosphate and nitro-blue 
tetrazolium. Immunoblots were scanned with a Scan Jet Plus 

25 (Hewlett-Packard) densitometer. Signal intensities for 
unstimulated MCF-7 cells were 20 to 30 units. Fully 
stimulated pl85 ftrbB2 yielded signals of 180 to 200 units. 
The 0.6 M NaCl pool, which contained most of the activity, 
was applied to a polyaspartic acid (PolyLC) column 

30 equilibrated in 17 mM sodium phosphate (pH €.8) containing- 
ethanol (30%). A linear gradient from 0.3 M to 0.6 M NaCl 
in the equilibration buffer was used to elute bound 
proteins. A peak of activity (at -0.45 M NaCl) was further 
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fractionated on a C4 reversed-phase column (SynChropak RP-4) 
equilibrated in buffer containing TFA (0.1%) and 
acetonitrile (15%) . Proteins were eluted from this column 
with an acetonitrile gradient from 25 to 40% over 60 min. 
5 Fractions (1 ml) were collected, assayed for activity, and 
analyzed by SDS-PAGE on tris-glycine gels (4-20%, Novex) . 
HPLC-purif ied HRG-o was digested with lysine C in SDS 
(0.1%), 10 mM dithiothreitol, 0.1 M NH 4 HC0 3 (pH 8.0) for 20 
hours at 37 *C and the resultant fragments were resolved on a 
10 Synchrom C4 column (4000A C , 0.2 by 10 cm). The column was 
equilibrated in 0.1% TFA and eluted with a l-propanol 
gradient in 0.1% TFA (W. J. Henzel, J. T. Stults, C. Hsu, D. 
W. Aswad, J*. Biol. Chem. 264. 15905 (1989)). Peaks from the 
chromatographic run were dried under vacuum and sequenced. 
15 One of the peptides (eluting at -24% l-propanol) gave the 
sequence [ A ] AEKEKTF [ C ] VNGGEXFMVKDLXNP (SEQ ID No. 162). 
Residues in brackets were uncertain and an X represents a 
cycle in which it was not possible to identify the amino 
acid. The initial yield was 8.5 pmol and the sequence did 
20 not correspond to any known protein. Residues 1, 9, 15, and 
22 were later identified in the cDNA sequence as cysteine. 
Direct sequencing of the -45-kD band from a gel that had 
been overloaded and blotted onto a PVDF membrane revealed a 
low abundance sequence XEXKE[G] [R3GK[G]K[G]KKKEXGXG[K] (SEQ 
25 ID No. 163) with a very low initial yield (0.2 pmol). This 
corresponded to amino acid residues 2 to 22 of heregulin-a 
(Fig. 31) , suggesting that serine 2 is the NH 2 -terminus of 
proHRG-a. Although the NH 2 terminus was blocked, it was 
observed that occasionally a small amount of a normally 
30 blocked protein may not be post-translationally modified. 
The NH 2 terminal assignment was confirmed by mass 
spectrometry of the protein after digestion with cyanogen 
bromide. The COOH-terminus of the isolated protein has not 



been definitely identified; however, by mixture sequencing 
of proteolytic digests, the mature sequence does not appear 
to extend past residue 241. Abbreviations for amino 
residues are: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, 
5 Gly; H, His; I # lie; K, Lys; L, Leu; M, Ket; N # Asn; P f Pro; 
Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 
As a source of cDNA clones, an oligo(dT) -primed JLgtlO (T. V. 
Huynn, R. A, Young, R. K. Davis, JtgtlO and igtll DNA Cloning 
Techniques: A Practical Approach, D. Glover, Ed. (IRC 

10 Press, Oxford, (1984)) cDNA library was constructed (U. 
Gubler and B. J, Hoffman, Gene 2£, 263 (1983)) vith mRNA 
purified (J. M. Chirwin, A. E. Przbyla, R* J. KacDonald, W. 
J. Rutter, Biochemistry 1£, 5294 (1979)) from KDA-KB-231 
cells* The following eightfold degenerate antisense 

15 deoxyoligonucleotide encoding the 13-amino acid sequence 

AEKEKTFCVNGGE (SEQ ID No. 164) (13) was designed on the basis 
of human codon frequency optima (R. Lathe, J. Mol. Biol. 
183 f l (1985)) and chemically synthesized: 
5'-CTCGCC (G OR T) CC (A OR G) TTCAC (A OR G) 

20 CAGAAGGTCTTCTCCTTCTCAGC-3' (SEQ ID No. 165), For the 
purpose of probe design a cysteine was assigned to an 
unknown residue in the amino acid sequence . The probe was 
labeled by phosphorylation and hybridized under 
low-stringency conditions to the cDNA library. The proHRG-a 

25 protein was identified in this library. HRB-B1 cDNA was 
identified by probing a second oligo(dT) -primed XgtlO 
library made from MDA-KB-231 cell mRNA with sequences 
derived from both the 5' and 3' ends of proHRG-a. Clone 13 
(Fig. 2A) was a product of screening a primed 

30 (5'-CCTCGCTCCTTCTTCTTGCCCTTC-3 f primer (SEQ ID No. 166); 
proKRG-a antisense nucleotides 33 to 56) KDA-KB-231 AgtlO 
library with 5' HRG-a sequence. A sequence corresponding to 
the 5' end of clone 13 as the probe was used to identify 



proKRG62 and proHRGB3 in a third oligo(dT) -primed XgtlO 
library derived from HDA-KB-231 cell mRNA. Two cDNA clones 
encoding each of the four HRGs were sequenced (F. Sanger, S. 
Kilken, A. R. Coulson, Proc. Hatl. Acad. Sci. U.S.A. 24., 5463 
1977]). Another cDNA designated clone 84 has an amino acid . 
sequence identical to proHRGB2 through amino acid 420. A 
stop codon at position 421 is followed by a different 
3 '-untranslated sequence. 



EXAMPLE 9 

junction of a Further f pi icing Variant 

The methods in Example 6 produced four closely related 
sequences (heregulin a, Bl, 62, B3) which arise as a result 
of splicing variation. Peles et al. (Cell ££, 205 (1992)), 
and Wen et al. (Cell ££, 559 (1992)) have isolated another 
splicing variant (from rat) using a similar purification and 
cloning approach to that described in Examples 1-4 and 6 
involving a protein which binds to pl85 erbB2 . The cDNA 
clone was obtained as follows (via the purification and 
sequencing of a pl85 erbB2 binding protein from a transformed 
> rat fibroblast cell line). 

A pl85 erbB2 binding protein was purified from conditioned 
medium as follows. Pooled conditioned medium from three 
harvests of 500 roller bottles (120 liters total) was 
cleared by filtration through 0.2 ft filters and concentrated 
5 31-fold with a Pelicon ultrafiltration system using 
membranes with a 20kd molecular size cutoff. All the 
purification steps were performed by using a Pharmacia fast 
protein liquid chromatography system. The concentrated 
material was directly loaded on a column of 
0 heparin-Sepharose (150 ml, preequilibrated with 

phosphate-buffered saline (PBS)). The column was washed 
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with PBS containing 0.2 M NaCl until no absorbance at 280 nm 
vavelength could be detected. Bound proteins were then 
eluted vith a continuous gradient (250 ml) of NaCl (from 0*2 
K to 1.0 K), and 5 inl fractions vere collected. Samples 
5 (0.01 ml of the collected fractions vere used for the 
quantitative assay of the kinase stimulatory activity. 
Active fractions from three column runs (total volume * 360 
ml) vere pooled, concentrated to 25 ml by using a YK10 
ultrafiltration membrane (Amicon, Danvers, MA), and ammonium 
10 sulfate vas added to reach a concentration of 1.7 K. After 
clearance by centrifugation (10,000 x g, 15 min.), the 
pooled material vas loaded on a phenyl-Superose column 
(HR10/10, Pharmacia). The column vas developed vith a 45 ml 
gradient of (NH 4 ) 2 S0 4 (from 1.7 H to no salt) in 0.1 M 
15 Ka 2 P0 4 (pH 7.4), and 2 ml fractions vere collected and 
assayed (0.002 ml per sample) for kinase stimulation (as 
described in Example €) . The major peak of activity vas 
pooled and dialyzed against 50 mK sodium phosphate buffer 
(pH 7.3). A Mono-S cation-exchange column (HR5/5, 
20 Pharmacia) vas preequilibrated vith 50 mK sodium phosphate. 
After loading the active material (0.884 mg of protein; 35 
ml) , the column vas vashed vith the starting buffer and then 
developed at a rate of 1 ml/min. vith a gradient of NaCl. 
The kinase stimulatory activity vas recovered at 0.45-0.55 K 
25 salt and vas spread over four fractions of 2 ml each. These 
vere pooled and loaded directly on a Cu +2 chelating columns 
(1.6 ml, HR2/5 chelating Superose, Pharmacia), Most of the 
proteins adsorbed to the resin, but they gradually eluted 
vith a 30 ml linear gradient of ammonium chloride (0-1 K) . 
30 The activity eluted in a single peak of protein at the range 
of 0.05 to 0.2 M NH 4 C1. Samples from various steps of 
purification vere analyzed by gel electrophoresis folloved 
by silver staining using a kit from ICN (Costa Kesa, CA) , 

- 76 - 



and their protein contents were determined with a Coomassie 
blue dye binding assay using a kit from Bio-Rad (Richmond , 
CA) • 

The p44 protein (10 pg) vas reconstituted in 200 til of 0.1 H 
5 ammonium bicarbonate buffer (pH 7.8). Digestion vas 

conducted with L-l-tosyl-amide 2-phenylethyl chloromethyl 
ketone-treated trypsin (Serva) at 37 *C for 18 hr. at an 
enzyme-to-substrate ratio of 1:10. The resulting peptide 
mixture vas separated by reverse-phase KPLC and monitored at 
10 215 nm using a Vydac C4 micro column (2.1 mm i.d. x 15 cm, 

300 A) and an HP 1090 liquid chromatographic system equipped 
vith a diode-array detector and a vorkstation. The column 
vas equilibrated vith 0.1% trif luoroacetic acid (mobile 
phase A) , and elution vas effected vith a linear gradient 
15 from 0%-55% mobile phase B (90% acetonitrile in 0.1% 

trif luoroacetic acid) over 70 min. The flov rate vas 0.2 
ml/min. and the column temperature vas controlled at 25 *C. 
One-third aliquots of the peptide peaks collected manually 
from the HPLC system vere characterized by N-terminal 
20 sequence analysis by Edman degradation. The fraction eluted 
after 27.7 min. (T27.7) contained mixed amino acid sequences 
and vas further rechromatographed after reduction as 
follows: A 70% aliquot of the peptide fraction vas dried in 
vacuo and reconstituted in 100 pi of 0.2 M ammonium 
25 bicarbonate buffer (pH 7.8). DTT (final concentration 2 mM) 
vas added to the solution, vhich vas then incubated at 37 # C 
for 3 0 min. The reduced peptide mixture vas then separated 
by reverse-phase HPLC using a Vydac column (2.1 mm i.d. x 15 
cm) . Elution conditions and flov rat'vere identical to 
30 those described above. Amino acid sequence analysis of the 
peptide vas performed vith a Model 477 protein sequencer 
(Applied Biosystems r Inc. f Foster City, CA) equipped vith an 
on-line phenylthiohydantoin (PTH) amino acid analyzer and a 
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Model 900 data analysis system (Hunkapiller et al. (1986) In 
Methods of Protei n Microcharacterization , J.E. Shively # ed. 
(Clifton, New Jersey: Humana Press p. 223-247) . The protein 
vas loaded onto a trif luoroacetic acid-treated glass fiber 
5 disc precycled with polybrene and NaCl. The PTH-amino acid 
analysis vas performed with a micro liquid chromatography 
system (Model 120) using dual syringe pumps and 
reverse-phase (C-18) narrow bore columns (Applied 
£iosystems, 2.1 am x 250 mm) • 
10 SNA vas isolated from Ratl-EJ cells by standard procedures 
(Maniatis et al.. Molecular Cloning: A Laboratory Manual 
(Cold Spring Harbor, New York (1982) and poly (A)* vas 
selected using an mRNA Separator kit (Clontech Lab, Inc., 
Palo Alto, CA) . cDNA vas synthesized vith the Superscript 
15 kit (from BRL Life Technologies, Inc., Bethesda, MD) . 

Column-fractionated double-strand cDNA vas ligated into an 
Sail- and Notl-digested pJT-2 plasmid vector, a derivative 
of the pCD-X vector (Okayama and Berg, Mol. Cell Biol. 2: 
280 (1983)) and transformed into DH10B £. coli cells by 
2 0 electroporation (Dower et al., Nucl. Acids Res. ^6: 6127 

(1988)). Approximately 5 x 10 s primary transf ormants vere 
screened vith^ tvo oligonucleotide probes that vere derived 
from the protein sequences of the N-terminus of NDF 
(residues 5-24) and the T40.4 tryptic peptide (residues 
25 7-12). Their respective sequences vere as follows (N 
indicates all 4 nt) : 

(1) 5' -ATA GGG AAG GGC GGG GGA AGG GTC KCC CTC NGC 
A T 

AGG GCC GGG CTT GCC TCT GGA GCC TCT-3' 
30 (2) 5'-TTT ACA CAT ATA TTC NCC-3' 
C G G C 
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(1: SEQ ID No. 167; 2: SEQ ID No. 168) 

The synthetic oligonucleotides were end-labeled with 
CY- 32 P]ATP with T4 polynucleotide kinase and used to screen 
replicate sets of nitrocellulose filters. The hybridization 
5 solution contained 6 x SSC, 50 mM sodium phosphate (pH 6.8), 
0.1% sodium pyrophosphate, 2 x Denhardt's solution, 50 itq/ml 
salmon sperm DNA, and 20* formamide (for probe i) or no 
formamide (for probe 2). The filters were washed at either 
50 *C with 0.5 x SSC, 0.2* SDS, 2 mM EDTA (for probe 1) or at 

10 37'C with 2 x SSC, 0.2* SDS, 2 mM EDTA (for probe 2). 
Autoradiography of the filters gave ten clones that 
hybridized with both probes. These clones were purified by 
replating and probe hybridization as described above. 
The cDNA clones were sequenced using an Applied Biosystems 

15 3 73 A automated DNA sequencer and Applied Biosystems Taq 
DyeDeoxy* Terminator cycle sequencing kits following the 
manufacture's instructions. In some instances, sequences 
were obtained using [ 35 S3dATP (Amersham) and Sequenase* kits 
from U.S. Biochemicals following the manufacturer's 

20 instructions. Both strands of the cDNA clone 44 were 

sequenced by using synthetic oligonucleotides as primers. 
The sequence of the most 5' 350 nt was determined in seven 
independent cDNA clones. The resultant clone demonstrated 
the pattern shown in figure 30 (NDF) . 

25 EXAMPLE 10 

Strategies for Dete cting Other Possible Sparing Variants 
Alignment of the deduced amino acid sequences of the cDNA 
clones and PCR products of the bovine, and the published 
human (Fig. 31) and rat sequences show a high level of 

30 similarity, indicating that these sequences are derived from 
homologous genes within the three species. The variable 
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number of messenger RNA transcripts detectable at the 
cDNA/PCR product level is probably due to- extensive . 
tissue-specific splicing. The patterns obtained and shown 
in Figure 30 suggests that other splicing variants exist. A 
list of probable splicing variants is indicated in Figure 
37. Many of these variants can be obtained by coding 
segment specific probing of cDNA libraries derived from 
different tissues and by PCR experiments using primer pairs 
specific to particular coding segments. Alternatively, the 
variants can be assembled from specific cDNA clones, PCR 
products or genomic DNA regions via cutting and splicing 
techniques known to one skilled in the art. For example, a 
rare restriction enzyme cutting site in a common coding 
segment (e.g., A), can be used to connect the FBA amino 
terminus of GGF2BPP5 to carboxy terminal sequences of 
GGF2BPP1, GGFBPP2, GGFBPP3, or GGFBPP4. If the presence or 
the absence of coding segment E and/or G provide benefit for 
contemplated and stated uses, then these coding segments can 
be included in expression constructs. These variant 
sequences can be expressed in recombinant systems and the 
recombinant products can be assayed to determine their level 
of Schwann cell mitogenic activity as well as their ability 
to bind and activate the pl85 erbB2 receptor. 

EXAMPLE 11 

Identification of Functional Elements of GGF 
The deduced structures of the family of GGF 
sequences indicate that the longest forms (as represented by 
GGF2BPP4) encode transmembrane proteins where the 
extracellular part contains a domain which resembles 
epidermal growth factor (see Carpenter and Wahl in Peptide 
Growth Factors and Their Receptors I pp. 69-133, 
Springer-Verlag, NY 1991). The positions of the cysteine 



residues in coding segments C end C/D or C/D' peptide 
sequence are conserved with respect to the analogous 
residues in the epidermal growth factor (EGF) peptide 
sequence (see Figure 35, SEQ ID Nos. 151-153). This 
5 suggests that the extracellular domain functions as receptor 
recognition and biological activation sites. Several of the 
variant forms lack the H, K, and L coding segments and thus 
may be expressed as secreted, diffusible biologically active 
proteins. GGF DNA sequences encoding polypeptides which 
10 encompass the EGF-like domain (EGFL) can have full 

biological activity for stimulating glial cell mitogenic 
activity. 

Kembrane bound versions of this protein may induce 
Schwann cell proliferation if expressed on the surface of 

15 neurons during embryogenesis or during nerve regeneration 
(where the surfaces of neurons are intimately associated 
with the surfaces of proliferating Schwann cells) . 

Secreted (non membrane bound) GGFs may act as 
classically diffusible factors which can interact with 

20 Schwann cells at some distance from their point of 

secretion. Other forms may be released from intracells by 
sources via tissue injury and cell disruption. An example 
of a secreted GGF is the protein encoded by GGF2HBS5 (see 
example 6) ; this is the only GGF known which has been found 

25 to be directed to the exterior of the cell (example 7) . 

Secretion is probably mediated via an N-terminal hydrophobic 
sequence found only in region E, which is the N-terminal 
domain contained within recombinant GGF-II encoded by 
GGF2HBS5. 

30 Other GGF's appear to be non-secreted (see example 

6) . These GGFs may be injury response forms which are 
released as a consequence of tissue damage. 
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Other regions of the predicted protein structure of 
GGF-II (encoded by- GGF2HBS5) and other proteins containing 
regions B and A exhibit similarities to the human basement 
membrane heparin sulfate proteoglycan core protein (Kallunk, 
5 P. and Tryggvason, K. , Cell Biology Vol. 116, p. 559-571 
(1992)). The peptide ADS GEY, which is located next to the 
second cysteine of the C2 immunoglobulin fold in these 
GGF's, occurs in nine of twenty-two C-2 repeats found in 
that basal lamina protein. This evidence strongly suggests 
10 that these proteins may associate with matrix proteins such 
as those associated with neurons and glia, and may suggest a 
method for sequestration of glial growth factors at target 
sites. 

EXAMPLE 12 

15 Purification of GGFs from Recombinant Cells 

In order to obtain full length or portions of GGFs 
to assay for biological activity, the proteins can be 
overproduced using cloned DNA. Several approaches can be 
used. A recombinant £. coli cell containing the sequences 

20 described above can be constructed. Expression systems such 
as pNH8a or pHH16a (Stratagene, Inc.) can be used for this 
purpose by following manufacturers procedures. 
Alternatively, these sequences can be inserted in a 
mammalian expression vector and an overproducing cell line 

25 can be constructed. As an example, for this purpose DNA 
encoding a GGF, clone GGF2BPP5 has been expressed in both 
COS cells and Chinese hamster ovary cells (see Example 7) 
(J. Biol. Chem. 263, 3521-3527, (1981) ) . This vector 
containing GGF DNA sequences can be transfected into host 

30 cells using established procedures. 

Transient expression can be examined or 
G418-resistant clones can be grown in the presence of 
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methotrexate to select for cells that amplify the dhfr gene 
(contained on the pKSXND vector) and, in the process, 
co-amplify the adjacent GGF protein encoding sequence. 
Because CKO cells can be maintained in a totally serum-free, 
5 protein-free medium (Hamilton and Ham, In Vitro 22, 537-547 
(1977)), the desired protein can be purified from the 
medium. Western analysis using the antisera produced in 
Example 9 can be used to detect the presence of the desired 
protein in the conditioned medium of the overproducing 
10 cells. 

The desired protein (rGGF-II) was purified from the 
medium conditioned by transiently expressing COS cells as 
follows. rGGF-II was harvested from the conditioned medium 
and partially purified using Cation Exchange Chromatography 

15 (POROS-HS). The column was equilibrated with 33.3 mM KES at 
pH 6.0, Conditioned media was loaded at flow rate of 10 
ml/min. The peak containing Schwann cell proliferation 
activity and immunoreactive (using the polyclonal antisera 
was against a GGFII peptide described above) was eluted with 

20 50 mM Tris, 1M NaCl pH 8.0. (Figure 50A and SOB 
respectively) . 

rGGF-II is also expressed using a stable Chinese 
Hamster Ovary cell line. rGGF-II from the harvested 
conditioned media was partially purified using Cation 

25 Exchange Chromatograph (POROS-HS) . The column was 

equilibrated with PBS pH 7.4. Conditioned media was loaded 
at 10 ml/min. The peak containing the Schwann Cell 
Proliferative activity and immunoreactivity (using GGFII 
polyclonal antisera) was eluted with 50 mM Hepes, 500 mM 

30 NaCl pH 8.0. An additional peak was observed at 50 mM 
Hepes, IK NaCl pH 8.0 with both proliferation as well as 
immunoreactivity (Fig. 51) . 
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rGGF-II can be further purif ied using Hydrophobic 
Interaction Chromatography as a high resolution step; Cation 
Exchange/Reverse phase Chromatography (if needed as second 
high resolution step) ; a viral inactivation step and a DNA 
5 removal step such as Anion Exchange chromatography. 

Detailed description of procedures used are as 
follows: 

Schwann Cell Proliferation Activity of the 
recombinant GGF-II peak eluted from the Cation Exchange 

10 column was determined as follows: Mitogenic responses of 
the cultured Schwann cells were measured in the presence of 
5 jiM forskolin using the peak eluted by 50 mM Tris 1 M NaCl 
pH 8.0. The peak was added at 20 l f 10 1 (1:10) 10 1 and 
(1:100) 10 l. Incorporation of 125 I-Uridine was determined 

15 and expressed as (CPM) following an 18-24 hour exposure. 

An immunoblot using polyclonal antibody raised 
against a peptide of GGF-II was carried out as follows: 10 
^1 of different fractions were run on 4-12% gradient gels. 
The gels were transferred on to Nitrocellulose paper, and 

20 the nitrocellulose blots were blocked with 5% BSA and probed 
with GGF-II-specific antibody (1:250 dilution), 225 I 
protein A (1:500 dilution, Specific Activity = 9.0/ci/g) was 
used as the secondary antibody. The immunoblots were 
exposed to Kodax X-Ray films for 6 hours. The peak 

25 fractions eluted with 1 K NaCl showed a broad immunoreactive 
band at 65-90 Kd which is the expected size range for GGFII 
and higher molecular weight glycoforms. 

GGF-II purification on cation exchange columns was 
performed as follows: CHO cell conditioned media expressing 
30 rGGFII was loaded on the cation exchange column at 10 

ml/min. The column was equilibrated with PBS pH 7.4. The 
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elution was achieved with 50 mM Hepes 500 mM NaCl pH 8.0 and 
50 mM Hepes 1M NaCl pH 8.0 respectively. All fractions were 
analyzed using the Schwann cell proliferation assay (CPM) 
described herein. The protein concentration (mg/ml) was 
5 determined by the Bradford assay using BSA as the standard. ' 
A Western blot using 10 pi of each fraction was 
performed. As indicated in Figure 51A and 51B, 
immunoreactivity and the Schwann cell activity co-migrates. 
The Schwann cell mitogenic assay described herein 

10 may be used to assay the expressed product of the full 

length clone or any biologically active portions thereof. 
The full length clone GGF2BPP5 has been expressed 
transiently in COS cells. Intracellular extracts of 
transfected COS cells show biological activity when assayed 

15 in the Schwann cell proliferation assay described in Example 
1. In addition, the full length close encoding GGF2HBS5 has 
been expressed stably in CHO and insect viral systems 
(Example 7) cells. In this case both cell extract and 
conditioned media show biological activity in the Schwann 

20 cell proliferation assay described in Example 1. Any member 
of the family of splicing variant complementary DNA's 
derived from the GGF gene (including the Heregulins) can be 
expressed in this manner and assayed in the Schwann cell 
proliferation assay by one skilled in the art. 

25 Alternatively, recombinant material may be isolated 

from other variants according to Wen et al. (Cell §£, 559 
(1992)) who expressed the splicing variant Neu 
differentiation factor (NDF) in COS-7 cells. cDNA clones 
inserted in the pJT-2 eukaryotic plasmid vector are under 

30 the control of the SV40 early promoter, and are 3 '-flanked 
with the SV4 0 termination and polyadenylation signals. 
COS-7 cells were transfected with the pJT-2 plasmid DNA by 
electroporation as follows: 6 x 10 6 cells (in 0.8 ml of 
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DKEK and 10% FEBS) were transferred to a 0.4 cm cuvette and 
mixed with 20 of plasmid DNA in 10 Ml of TE solution (10 
mM Tris-HCl (pH 8.0), 1 mM EDTA). Electroporation was 
performed at room temperature at 1600 V and 25 pY using a 
Bio-Rad Gene Pulser apparatus with the pulse controller unit 
set at 200 ohms. The cells were then diluted into 20 ml of 
DKEK, 10% FBS and transferred into a T75 flask (Falcon) . 
After 14 hr. of incubation at 37 *C, the medium was replaced 
with DKEM, 1% FBS, and the incubation continued for an 
additional 48 hr. Conditioned medium containing recombinant 
protein which was harvested from the cells demonstrated 
biological activity in a cell line expressing the receptor 
for this protein'. This cell line (cultured human breast 
carcinoma cell line AU 565) was treated with recombinant 
material. The treated cells exhibited a morphology change 
which is characteristic of the activation of the erbB2 
receptor. Conditioned medium of this type also can be 
tested in the Schwann cell proliferation assay. 

EXAMPLE 13 

Purification and Assay of Other Proteins which bind 
t>1£5 erbB2 R eeet> tor 

I. Purification of ctp30 and p70 

Lupu et al. (Science £4£, 1552 (1990)) and Lippman 
and Lupu (patent application number PCT/US91/03443 (1990)), 
hereby incorporated by reference, have purified a protein 
from conditioned media of a human breast cancer cell line 
KDA-MB-231, as follows. 

Conditioned media collections were carried using 
well-known procedures. The media was concentrated 100-fold 
in an Amicon ultra-filtration cell (YK5 membrane) (Amicon, 
Danvers, MA) . Once clarified and concentrated, the media 
were stored at -20°C while consecutive collections were made 
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during the following days. The concentrated media were 
dialyzed using Spectra/por® 3 tubing (Spectrum Kedical 
Industries, Los Angeles, CA) against 100 volumes of 0.1 M 
acetic acid over a two day period at 4'C. The material that 
5 precipitated during dialysis was removed by centrifugation 
at 4000 rpm for 30 min. at 4*C; protease inhibitors were 
added. The clarified sample was then lyophilized. 

Lyophilized conditioned medium was dissolved in 1 H 
acetic acid to a final concentration of about 25 mg/ml total 

10 protein. Insoluble material was removed by centrifugation 
at 10,000 rpm for 15 minutes. The sample was then loaded 
onto a Sephadex G-100 column (XK 16, Pharmacia, Piscataway, 
NJ) , was equilibrated and was subjected to elution with 1 M 
acetic acid at 4*C with an upward flow of 30 ml/hr. 100 ng 

15 of protein was processed from 4 ml of 100-fold concentrated 
medium. Fractions containing 3 ml of eluate were 
lyophilized and resuspended in 300 nl PBS for assay and 
served as a source for further purification. 

Sephadex G-100 purified material was run on 

20 reversed-phase high pressure liquid chromatography (HPLC) . 
The first step involved a steep acetonitrile gradient. 
Steep acetonitrile gradient and all other HPLC steps were 
carried out at room temperature after equilibration of the 
C3-Reversed phase column with 0.05% TFA (Trif luoroacetic 

25 acid) in water (HFLC-grade) . The samples were loaded and 
fractions were eluted with a linear gradient (0-45% 
acetonitrile in 0.05% TFA) at a flow rate of 1 ml /min. over 
a 30 minute period. Absorbance was monitored at 280 nm. 
One ml fractions were collected and lyophilized before 

30 analysis for EGF receptor-competing activity. 

A second HPLC step involved a shallow acetonitrile 
gradient. The pool of active fractions from the previous 
HPLC step was rechromatographed over the same column. 



Elution was performed with a 0-18% acetonitrile gradient in 
0,05% TFA over a 5 minute period followed by a linear 18-45% 
acetonitrile gradient in 0.05% TFA over a 30 minute period. 
The flow rate was 1.0 ml/min. and 1 ml fractions were 
5 collected. Human TGFcr-like factor was eluted at a 30-32% 
acetonitrile concentration as a single peak detectable by 

Lupu et al. (Proc. Natl. Acad. Sci. ££, 2287 (1992)) 
purified another protein which binds to the pl85 erbB2 

10 receptor. This particular protein, p75, was purified from 
conditioned medium used for the growth of SKBr-3 (a human 
breast cancer cell line) propagated in improved Eagle's 
medium (IMEH: GIBCO) supplemented with 10% fetal bovine 
serum (GIBCO) . Protein p75 was purified from concentrated 

15 (100X) conditioned medium using a pl85 erbB2 affinity column. 
The 94 Kilodalton extracellular domain of pl85 erbB2 (which 
binds p75) was produced via recombinant expression and was 
coupled to a polyacrylamide hydrazido-Sepharose affinity 
chromatography matrix. Following coupling the matrix was 

20 washed extensively with ice cold 1.0 M HC1 and the beads 
were activated with 0.5 M NaN0 2 . The temperature was 
maintained at 0'C for 20 minutes and this was followed by 
filtration and washing with ice cold 0.1 M HC1. 500 ml of 
concentrated conditioned medium was run through the beads by 

25 gravity. The column was washed and eluted stepwise with 1.0 
K citric acid at pH values from 4.0 to 2.0 (to allow 
dissociation of the erbB2 and p75) . All fractions were 
desalted on Pharmacia PD10 columns. Purification yielded a 
homogeneous polypeptide of 75kDa at 3.0-3.5 elution pH 

30 (confirmed by analysis on SDS/PAGE by silver staining). 
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II. Binding of cro30 to ol85 egb B2 

The purified gp30 protein was tested in an assay to 
determine if it bound to pl85* rbB2 . A competition assay 
with a monoclonal antibody against pl85 erbB2 . The gp30 
5 protein displaced antibody binding to pl85* rbB2 in SK-BR-3 
and HDA-HB-453 cells (human breast carcinoma cell lines 
expressing the pl85 erbB2 receptor). Schwann cell 
proliferation activity of gp30 can also be demonstrated by 
treating Schwann cell cultures with purified gp30 using the 
10 assay procedure described in Examples 1-3. 

ITT. Binding of r>75 to plB5 erbB2 

To assess whether the 75-kDa polypeptide (p75) 
obtained from SKBr-3 conditioned medium was indeed a ligand 
for the erbB2 oncoprotein in SKBr-3 cells, a competition 

15 assay as described above for gp30 was used. It was found 
that the p75 exhibited binding activity, whereas material 
from other chromatography fractions did not show such 
activity (data not shown) . The flow-through material showed 
some binding activity. This might be due to the presence of 

20 shed erbB2 ECD. 

TV. Other piS5 erbB2 lioands 

Peles et al. (Cell £9_, 205 (1992)) have also 
purified a i85 erbB2 stimulating ligand from rat cells, (NDF, 
see Example 8 for method). Holmes et al. (Science 256. 1205 

25 (1992)) have purified Heregulin o from human cells which 
binds and stimulates l85 erbB2 (see example 6) . Tarakovsky 
et al. Oncogene 6:218 (1991) have demonstrated bending of a 
25 kD polypeptide isolated from activated macrophages to the 
Neu receptor, a pl85 erbB2 homology, herein incorporated by 

30 reference. 
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VI. KDF Isolation 

Yarden and Peles (Biochemistry 2£, 3543 (1991)) have 
identified a 35 kilodalton glycoprotein which vill stimulate 
the i85 erbB2 receptor. The protein was identified in 
5 conditioned medium according to the following procedure. 
Rat X-EJ cells were grown to confluence in 17 5 -cm 2 flasks 
(Falcon) . Monolayers were washed with PBS and left In 
serum-free medium for 10-16 h. The medium was discarded and 
replaced by fresh serum-free medium that was collected after 

10 3 days in culture. The conditioned medium was cleared by 
low-speed centrifugation and concentrated 100-fold in an 
Amicon ultrafiltration cell with a YM2 membrane (molecular 
weight cutoff of 2000). Biochemical analyses of the neu 
stimulatory activity in conditioned medium indicate that the 

15 ligand is a 35-kD glycoprotein that it is heat stable but 
sensitive to reduction. The factor is precipitable by 
either high salt concentrations or acidic alcohol. Partial 
purification of the molecule by selective precipitation, 
heparin-agarose chromatography, and gel filtration in dilute 

20 acid resulted in an active ligand, which is capable of 

stimulating the protooncogenic receptor but is ineffective 
on the oncogenic neu protein, which is const itutively 
active. The purified fraction, however, retained the 
ability to stimulate also the related receptor for EGF, 

25 suggesting that these two receptors are functionally coupled 
through a bidirectional mechanism. Alternatively, the 
presumed ligand interacts simultaneously with both 
receptors. The presented biochemical characteristic of the 
factor may be used to enable a completely purified factor 

30 with which to explore these possibilities. 

In other publications, Davis et al. (Biochem. 
Biophys. Res. Commun. 179 . 1536 (1991), Proc. Natl. Acad. 
Sci. SB, 8582 (1991) and Greene et al., PCT patent 
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application PCT/US91/02331 (1990)) describe the purification 
of a protein from conditioned medium of a human T-cell 
(ATL-2) cell line. 

ATL-2 cell line is an IL-2 -independent HTLV-l (+) T 
cell line. Kycoplasm-free ATL-2 cells were maintained in 
RPMI 1640 medium containing 10% FCB as the culture medium 
(10% FCS-RPMI 1640) at 37 *C in a humidified atmosphere with 
5% C0 2 . 

For purification of the proteinaceous substance, 
ATL-2 cells were washed twice in 1 x PBS and cultured at 3 x 
10 s ml in serum-free RPMI 1640 medium/2 mM L-glutamine for 
seventy-two hours followed by pelleting of the cells. The 
culture supernatant so produced is termed "conditioned 
medium" (CM. ) . 

CM. was concentrated 100 fold, from 1 liter to 10 
ml, using a YM-2 Diaflo membrane (Amicon, Boston, MA) with a 
lOOOd cutoff. For use in some assays, concentrated CM. 
containing components greater than 1000 MW were rediluted to 
original volume with RPMI medium. Gel electrophoresis using 
a polyacrylamide gradient gel (Integrated Separation 
Systems, Hyde Park, MD or Phorecast System by Amersham, 
Arlington Heights, IL) followed by silver staining of some 
of this two column purified material from the one liter 
preparation revealed at least four to five bands of which 
the lOkD and 20kD bands were unique to this material. 
Passed CM. containing components less than 1000 NW were 
used without dilution. 

Concentrated conditioned medium was filter 
sterilized with a .45/* uniflo filter (Schleicher and 
Schuell, Keene, NH) and then further purified by application 
to a DEAE-SW anion exchange column (Waters, Inc., Milford, 
MA) which had been preequilibrated with lOmM Tris-Cl, pH 8.1 
Concentrated CM. proteins representing one liter of 
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original ATL-2 conditioned medium per HPLC run were absorbed 
to the column and then eluted with a linear gradient of OmM 
to 4 OmM HaCl at a flow rate of 4 ml/min. Fractions were 
assayed using an in vitro immune complex kinase assay with 
10% of the appropriate DEAE fraction (1 column purified 
material) or 1% of the appropriate C18 fractions (two column 
purified material) . The activity which increased the 
tyrosine kinase activity of p!85c-neu in a dose-dependent 
manner using the In vitro immune complex kinase assay was 
eluted as one dominant peak across 4 to 5 fractions (36-40) 
around 220 to 24 0 mM of NaCl. After HPLC-DEAE purification, 
the proteins in the active fractions were concentrated and 
pooled, concentrated and subjected to C18 (million matrix) 
reverse phase chromatography (Waters, Inc., Kilford, MA) 
(referred to as the C18+1 step or two column purified 
material) . Elution was performed under a linear gradient of 
2-propanol against 0.1% TFA. All the fractions were 
dialyzed against RPMI 1640 medium to remove the 2-propanol 
and assayed using the In vitro immune complex kinase assay, 
described below, and a 1% concentration of the appropriate 
fraction. The activity increasing the tyrosine kinase 
activity of pl85c-neu was eluted in two peaks. One eluted 
in fraction 11-13, while a second, slightly less active peak 
of activity eluted in fractions 20-23. These two peaks 
correspond to around 5 to 7% of isopropanol and 11 to 14% 
isopropanol respectively. C16/1 generated fractions 11-13 
were used in the characterization studies. Active fractions 
obtained from the second chromatographic step were pooled, 
and designated as the proteinaceous substance sample. 

A twenty liter preparation employed the same 
purification strategy. The DEAE active fractions 35-41 were 
pooled and subjected to cl8 chromatography as discussed 
above. C18#l fractions 11-13 and 21-24 both had 
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dose-dependent activity. The pool of fractions 11-13 was 
subjected to an additional C18 chromatographic step 
(referred to as C18#2 or three column purified material) . 
Again, fractions 11-13 and 21-24 had activity. The dose 
response of fraction 23 as determined by in vitrP immune 
complex kinase assay as described in Example 8 may be 
obtained upon addition of 0.005% by volume fraction 23 and 
0.05% by volume fraction 23. This represents the greatest 
purity achieved. 

Molecular weight ranges were determined based on gel 
filtration chromatography and ultrafiltration membrane 
analysis. Near equal amounts of tyrosine kinase activity 
were retained and passed by a 10,000 molecular weight cut 
off filter. Almost all activity was passed by a 30,000 
molecular weight cut off filter. Molecular weight ranges 
for active chromatographic fractions were determined by 
comparing fractions containing dose-dependent neu-activating 
activity to the elution profiles of a set of protein 
molecular weight standards (Sigma Chemical Co., St. Louis, 
MO) generated using the same running conditions. A low 
molecular weight region of activity was' identified between 
7,000 and 14,000 daltons. A second range of activity ranged 
from about 14,000 to about 24,000 daltons. 

After gel electrophoresis using a polyacrylamide 
gradient gel (Integrated Separation Systems, Hyde Park, KD 
or Phorecase System by Amersham, Arlington Heights, IL) , 
silver staining of the three-column purified material 
(cl8*2) was done with a commercially available silver 
staining kit (BioRad, Rockville Centre, NY). Fraction 21, 
22, 23, and 24 from cl8*2 purification of the twenty liter 
preparation were run with markers. Fractions 22 and 23 
showed the most potent dose response in the i85 erbB2 (neu) 
kinase assay (see below). The fact that selected molecular 



veight fractions interact with i85 erbB2 was demonstrated 
with an immune complex kinase assay. 

Huang et al. (1992, J. Biol. Chem. 257:11508-11512), 
hereby incorporated by reference, have isolated an 
5 additional neu/erb B2 ligand growth factor from bovine 
kidney. The 25 kD polypeptide factor was isolated by a 
procedure of column fractionation, followed by sequential 
column chromatography on DEAE/cellulose (DE52), Sulfadex 
(sulfated Sephadex G-50) , heparin-Sepharose 4B, and Superdex 
10 75 (fast protein liquid chromatography) . The factor, NEL- 
GF, stimulates tyros ine-specific autophosphorylation of the 
neu/erb B2 gene product. 

VII. Immune complex assay NDF for lioand binding to 
pi85 erb B2 : This assay reflects the differences in the 
15 autophosphorylation activity of immunoprecipitated pl85 

driven by pre-incubation of PN-HR6 cell lysate with varying 
anounts of ATL-2 conditioned medium (C.H.) or proteinaceous 
substance and is referred to hereinafter as neu-activating 
activity. 

20 Cell lines used in the immune complex kinase assay 

were obtained, prepared and cultured according to the 
methods disclosed in Kokai et al., Cell 287-292 (July 
28, 1989) the disclosures of which are hereby incorporated 
by reference as if fully set forth herein, and U.S. 

25 application serial number 386,820 filed July 27, 1989 in the 
name of Kark I. Green entitled "Methods of Treating 
Cancerous Cells with Anti-Receptor Antibodies", the 
disclosures of which are hereby incorporated by reference as 
if fully set forth herein. 

30 Cell lines were all maintained in DMEM medium 

containing 5% FCS as the culture medium (5% FCS-DMEM) at 
37 *C in a humidified atmosphere with 5% C0 2 . 
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Dense cultures of cells in 150 mm dishes were washed 
twice with cold PBS, scraped into 10 ml of freeze-thaw 
buffer (150 mM NaCl, 1 mM MgCl 2 , 20 mM Hepes f pH 7.2, 10% 
Glycerol, 1 mM EDTA, 1% Aprotinin), and centrifuged (600 x 
6, 10 minutes). Cell pellets were resuspended in 1 ml Lysis 
buffer (50 mM Hepes, pH 7.5, 150 mM NaCl, 3% Brij 35, 1 mM 
EDTA, 1.5 mM MgCl 2 , 1% Aprotinin, 1 mM EGTA, 20 tM Na 3 V0 4 , 
10% Glycerol) and rotated for thirty minutes at 4*C. All 
chemicals were from Sigma Chemical Co., St. Louis, Mo, 
unless otherwise indicated. The insoluble materials were 
removed by centrifugation at 40,000 x g for thirty minutes. 
The clear supernatant which was subsequently used is 
designated as cell lysate. 

The cell lysates were incubated for fifteen minutes 
with 50 txl of 50% (volume/volume) Protein A-sepharose (Sigma 
Chemical Co., St. Louis, Missouri), and centrifugated for 
two minutes to preclear the lysates. 50 til aliguots of 
precleared cell lysate were incubated on ice for fifteen 
minutes with conditioned medium, proteinaceous substance, or 
other factors as specified, in a final volume of 1 ml with 
lysis buffer. The sample was then incubated with 5 fig of 
7.16,4 monoclonal antibody, which recognizes the 
extracellular domain of the pl85neu and pl85c-neu, or other 
appropriate antibodies, for twenty minutes on ice, followed 
by a twenty minute incubation with 50 jul of 50% (vol/vol) 
protein A-Sepharose with rotation at 4*C. Immune complexes 
were collected by centrifugation, washed four times with 500 
til of washing buffer (50 mM Hepes, pH 7.5, 0.1%, Brij 35, 
150 mM NaCl, 2 mM EDTA, 1% Aprontinin, 30 ^m Ka 3 V0 4 ) , then 
twice with reaction buffer (20 mM Hepes (pH 7.4), 3 mM MnCl 2 
and 0.1% Brij 35, 30 Na 3 V0 4 ) . Pellets were resuspended 
in 50 txl of reaction buffer and (Gamma- 32 P]-ATP (Amersham, 
Arlington Heights, IL) was added giving a final 



concentration of 0.2 jra. The samples were incubated at 27 *c 
for twenty minutes or at 4*C for 25 minutes with purer 
samples. The reactions were terminated by addition of 3 x 
SDS sample buffer containing 2 mM ATP and 2 mM EDTA and then 
incubating them at 100 *C for five minutes* The samples were 
then subjected to SDS-PAGE analysis on 10% acrylamide gels. 
Gels vere stained, dried, and exposed to Kodak XAR or XRP 
film vith intensifying screens. 

VIII. Purification of acetylcholine receptor inducing 
activity (ARIA) 

ARIA, a 42 kD protein which stimulates acetylcholine 
receptor synthesis, has been isolated in the laboratory of 
Gerald Fischbach (Falls et al., Cell 72:801-815 (1993)). 
ARIA induces tyrosine phosphorylation of a 185 Kda muscle 
transmembrane protein which resembles p!85 erbB2 , and 
stimulates acetylcholine receptor synthesis in cultured 
embryonic myotubes. Sequence analysis of cDNA clones which 
encode ARIA shows that ARIA is a member of the GGF/erbB2 
ligand group of proteins, and this is potentially useful in 
the glial cell mitogenesis stimulation and other 
applications of, e.g., GGF2 described herein. 

EXAMPLE 14 

Protein tyrosine phosphorylation mediated bv GGF in Schwann 
cells 

Rat Schwann cells, following treatment with 
sufficient levels of Glial Growth Factor to induce 
proliferation, show stimulation of protein tyrosine 
phosphorylation (figure 36). Varying amounts of partially 
purified GGF were applied to a primary culture of rat 
Schwann cells according to the procedure outlined in Example 
3. Schwann cells were grown in DMEH/10% fetal calf serum/5 



MM forskolin/0.5/ig per mL GGF-CM (0.5mL per well) in poly 
D-lysine coated 24 well plates. When confluent, the cells 
were fed with DKEK/10% fetal calf serum at 0.5mL per well 
and left in the incubator overnight to quiesce. The 
5 following day, the cells were fed with 0.2mL of DHEM/10% 

fetal calf serum and left in the incubator for 1 hour. Test 
samples were then added directly to the medium at different 
concentrations and for different lengths of time as 
required. The cells were then lysed in boiling lysis buffer 

10 (sodium phosphate, 5mM, pH 6.8; SDS, 2%, B-mercapteothanol, 
5%; dithiothreitol, 0.1M; glycerol, 10%; Bromophenol Blue, 
0*4%; sodium vanadate, lOmK) , incubated in a boiling water 
bath for 10 minutes and then either analyzed directly or 
frozen at -70 # C. Samples were analyzed by running on 7.5% 

15 SDS-PAGE gels and then electroblotting onto nitrocellulose 
using standard procedures as described by Towbin et al. 
(1979) Proc. Katl. Acad. Sci. USA 76:4350-4354. The blotted 
nitrocellulose was probed with antiphosphotyrosine 
antibodies using standard methods as described in Kamps and 

20 Selton (1988) Oncogene 2:305-315. The probed blots were 
exposed to autoradiography film overnight and developed 
using a standard laboratory processor. Densitometric 
measurements were carried out using an Ultrascan XL enhanced 
laser densitometer (LKB) . Molecular weight assignments were 

25 made relative to prestained high molecular weight standards 
(Sigma) . The dose responses of protein phosphorylation and 
Schwann cell proliferation are very similar (figure 36). 
The molecular weight of the phosphorylated band is very 
close to the molecular weight of pl85 erbB2 . Similar results 

30 were obtained when Schwann cells were treated with 

conditioned media prepared from COS cells translates with 
the GGF2HBS5 clone. These results correlate well with the 



- 97 - 



expected interaction of the GGFs with and activation of 
185 erbB2 # 

This experiment has been repeated with recombinant 
GGF-II. Conditioned medium derived from a CHO cell line 
5 stably transformed with the GGF-II clone (GGF2HBS5) 

stimulates protein tyrosine phosphorylation using the assay 
described above. Mock transfected CHO cells fail to 
stimulate this activity (Fig. 52). 

EXAMPLE 15 

10 Assay for Schwann cell Proliferation by Protein Factor from 
the KPA-KB-231 cell line . 

Schwann cell proliferation is mediated by 
conditioned medium derived from the human breast cancer cell 
line KDA-KB-231. On day 1 of the assay , 10 4 primary rat 

15 Schwann cells were plated in 100 fil of Dulbecco's Modified 
Eagle's medium supplemented with 5% fetal bovine plasma per 
well in a 96 well microtiter plate. On day 2 of the assay, 
10 ixl of conditioned medium (from the human breast cancer 
cell line KDA-MB-231, cultured as described in Example 6) 

20 was added to each well of the microtiter plate. One day 6, 
the number of Schwann cells per plate was determined using 
an acid phosphatase assay (according to the procedure of 
Connolly et al. Anal. Biochem. 152 ; 136 (1986)). The plate 
was washed with 100 Ml of phosphate buffered saline (PBS) 

25 and 100 til of reaction buffer (0.1K sodium acetate, (pH 

5.5)), 0.1% Triton X-100, and 10 mM p-nitrophenyl phosphate) 
was added per well* The plate was incubated at 37* C for two 
hours and the reaction was stopped by the addition of 10 pi 
of IN KaOH. The optical density of each sample was read in 

30 a spectrophotometer at 410 nm. A 38% stimulation of cell 
number over Schwann cells treated with conditioned medium 
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from a control cell line (HS-294T, a non-producer of erbB-2 
ligand) was observed. This result shows that a protein 
secreted by the KDA-MB-231 cell line (which secretes a 
pl85 «rbB2 0 i n fli n g activity) stimulates Schwann cell 
5 proliferation. 

EXAMPLE 16 

N-alvcosvlation of GGF 

The protein sequence predicted from the cDNA 
sequence of GGF-II candidate clones GGF2BPP1,2 and 3 

10 contains a number of consensus N-glycosylation motifs. A 
gap in the GGFII02 peptide sequence coincides with the 
asparagine residue in one of these motifs, indicating that 
carbohydrate is probably bound at this site. 

N-glycosylation of the GGFs was studied by observing 

15 nobility changes on SDS-PAGE after incubation with N- 
glycanase, an enzyme that cleaves the covalent linkages 
between carbohydrate and aspargine residues in proteins. 

N-Glycanase treatment of GGF-II yielded a major band 
of KW 40-42 kDa and a minor band at 45-48 kDa. Activity 

20 elution experiments under non-reducing conditions showed a 
single active deglycosylated species at ca 45-50 kDa. 

Activity elution experiments with GGF-I also 
demonstrate an increase in electrophoretic mobility when 
treated with N-Glycanase, giving an active species of MW 26- 

25 28 kDa. Silver staining confirmed that there is a mobility 
shift, although no N-deglycosylated band could be assigned 
because of background staining in the sample used. 

Deposit 

Nucleic acid encoding GGF-II (cDNA, GGF2HBS5) 
30 protein (Example 6) in a plasmid pBluescript 5k, under the 
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control of the T7 promoter, was deposited in the American 
Type Culture Collection, Rockville, Maryland, on September 
2, 1992, and given ATCC Accession No. 75298. Applicant 
acknowledges its responsibility to replace this plasmid 
5 should it become non-viable before the end of the term of a 
patent issued hereon, and its responsibility to notify the 
ATCC of the issuance of such a patent, at which time the 
deposit will be made available to the public. Prior to that 
time the deposit will be made available to the Commissioner 
10 of Patents under the terms of 37 CFR 51.14 and 35 USC 5112. 
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GENERAL INFORMATION: 

(i) APPLICANTS: Goodearl, Andrew; Stroobant, Paul; 
Minghetti, Luisa; Waterfield, Michael; Marchioni, Mark; 
Chen, Maio Su; Hiles, Ian 

(ii) TITLE OF INVENTION: Glial Mitogenic Factors, Their 
Preparation and Use 

(iii) NUMBER OF SEQUENCES: 184 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Felfe & Lynch 

(B) STKEET: 305 Third Avenue 

(C) CITY: Ne«w- York City 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10022 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 5.25 inch, 360 kb storage 

(B) COMPUTER: IBM 

(C) OPERATING SYSTEM: PC-DOS 

(D) SOFTWARE: Wordperfect 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 03/036,555 

(B) FILING DATE: 24-MAR-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/965,173 

(B) FILING DATE: 23-OCT-1992 

(vii) PRIOR APPLICATION DATA: 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: U.K. 91 07566.3 

(B) FILING DATE: 10-APRIL-1991 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Tsai, Christine H. 

(B) REGISTRATION NUMBER: 34,266 

(C) REFERENCE/ DOCKET NUMBER: LUD 5250.17 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 688-9200 

(B) TELEFAX: (212) 838-3884 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



filPhe Lys Gly Asp Ala His Thr Glu 
\> 1 5 



CI (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2: 
E « (i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 13 

t, (B) TYPE: amino acid 

(C) STRANDEDNESS: 

If! (D) TOPOLOGY: linear 

I;H (ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 
C Xaa in position 12 is unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Xaa Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Xaa Lys 
15 10 



102 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: . . 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 

Xaa in position 10 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Thr Glu Thr Ser Ser Ser Gly Leu Xaa Leu Lys 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

3 (D) TOPOLOGY: linear 

3 (ix) FEATURE: . . 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
J (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



pXaa Lys Leu Gly Glu Met Trp Ala Glu 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 5: 
: (i) SEQUENCE CHARACTERISTICS: 

; (A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Leu Gly Glu Lys Arg Ala 
1 5 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Xaa He Lys Ser Glu His Ala Gly Leu Ser He Gly Asp Thr Ala Lys 
15 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
J (ix) FEATURE: 

38 (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine, 

y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

*?aa Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 

i 15 10 



;ff2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 8: 
* (i) SEQUENCE CHARACTERISTICS: 

£ (A) LENGTH: 16 

t (B) TYPE: amino acid 

" (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Xaa He Lys Gly Glu His Pro Gly Leu Ser He Gly Asp Val Ala Lys 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

and Xaa in position 12 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Xaa Met Ser Glu Tyr Ala Phe Phe Val Gin Thr Xaa Arg 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
3 ( ix) FEATURE : 

ffl (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 

'4 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

=fiaa Ser Glu His Pro Gly Leu Ser lie Gly Asp Thr Ala Lys 
U 15 10 



?0) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 11: 
JJ (i) SEQUENCE CHARACTERISTICS: 

!f (A) LENGTH: 10 

y (B) TYPE: amino acid 

^ (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 

Xaa in position 8 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Xaa Ala Gly Tyr Phe Ala Glu Xaa Ala Arg 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 

Xaa in position 7 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Lys Leu Glu Phe Leu Xaa Ala Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Xaa Thr Thr Glu Met Ala Ser Glu Gin Gly Ala 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Xaa Ala Lys Glu Ala Leu Ala Ala Leu Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: . . « . . 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Argmine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Phe Val Leu Gin Ala Lys Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

3 (D) TOPOLOGY: linear 

0 (ix) FEATURE : . . . . . 

J (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Argmine. 

J (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



=*Xaa Leu Gly Glu Met Trp 

0 1 5 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 17: 
f (i) SEQUENCE CHARACTERISTICS: 

i (A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Tyr Lys Cys Leu Lys Phe Lys Trp Phe Lys Lys Ala Thr Val Met 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 8 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Glu Ala Lys Tyr Phe Ser Lys Xaa Asp Ala 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: ... 

(D) OTHER INFORMATION: Xaa in position 2 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Glu Xaa Lys Phe Tyr Val Pro 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu Leu Ser Phe Ala Ser Val Arg Leu Pro Gly Cys Pro Pro Gly Val 
1 5 10 15 

Asp Pro Met Val Ser Phe Pro Val Ala Leu 
20 25 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2003 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear • 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in positions 31 and 32 could be either 

A or G. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GGAATTCCTT TTTTTTTTTT TTTTTTTCTT NNTTTTTTTT TGCCCTTATA CCTCTTCGCC 60 
TTTCTGTGGT TCCATCCACT TCTTCCCCCT CCTCCTCCCA TAAACAACTC TCCTACCCCT 120 
GCACCCCCAA TAAATAAATA AAAGGAGGAG GGCAAGGGGG GAGGAGGAGG AGTGGTGCTG 180 
CGAGGGGAAG GAAAAGGGAG GCAGCGCGAG AAGAGCCGGG CAGAGTCCGA ACCGACAGCC 240 
AGAAGCCCGC ACGCACCTCG CACC ATG AGA TGG CGA CGC GCC CCG CGC CGC 291 

Met Arg Trp Arg Arg Ala Pro Arg Arg 
1 5 

TCC GGG CGT CCC GGC CCC CGG GCC CAG CGC CCC GGC TCC GCC GCC CGC 339 
Ser Gly Arg Pro Gly Pro Arg Ala Gin Arg Pro Gly Ser Ala Ala Arg 
10 15 20 25 

BTCG TCG CCG CCG CTG CCG CTG CTG CCA CTA CTG CTG CTG CTG GGG ACC 387 

'JSer Ser Pro Pro Leu Pro Leu Leu Pro Leu Leu Leu Leu Leu Gly Thr 

jj 30 35 40 

JGCG GCC CTG GCG CCG GGG GCG GCG GCC GGC AAC GAG GCG GCT CCC GCG 435 

**Ala Ala Leu Ala Pro Gly Ala Ala Ala Gly Asn Glu Ala Ala Pro Ala 

n 45 50 55 

^GGG GCC TCG GTG TGC TAC TCG TCC CCG CCC AGC GTG GGA TCG GTG CAG 483 

iGly Ala Ser Val Cys Tyr Ser Ser Pro Pro Ser Val Gly Ser Val Gin 

i : 60 65 70 

nGAG CTA GCT CAG CGC GCC GCG GTG GTG ATC GAG GGA AAG GTG CAC CCG 531 

SGlu Leu Ala Gin Arg Ala Ala Val Val He Glu Gly Lys Val His Pro 
75 80 85 

CAG CGG CGG CAG CAG GGG GCA CTC GAC AGG AAG GCG GCG GCG GCG GCG 579 
Gin Arg Arg Gin Gin Gly Ala Leu Asp Arg Lys Ala Ala Ala Ala Ala 
90 95 100 105 

GGC GAG GCA GGG GCG TGG GGC GGC GAT CGC GAG CCG CCA GCC GCG GGC 627 
Gly Glu Ala Gly Ala Trp Gly Gly Asp Arg Glu Pro Pro Ala Ala Gly 

110 115 120 

CCA CGG GCG CTG GGG CCG CCC GCC GAG GAG CCG CTG CTC GCC GCC AAC 675 
Pro Arg Ala Leu Gly Pro Pro Ala Glu Glu Pro Leu Leu Ala Ala Asn 
125 130 135 
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GGG ACC GTG CCC TCT TGG CCC ACC GCC CCG GTG CCC AGC GCC GGC GAG 723 
Gly Thr Val Pro Ser Trp Pro Thr Ala Pro Val Pro Ser Ala Gly Glu 
140 145 150 

CCC GGG GAG GAG GCG CCC TAT CTG GTG AAG GTG CAC CAG GTG TGG GCG 771 
Pro Gly Glu Glu Ala Pro Tyr Leu Val Lys Val His Gin Val Trp Ala 
155 160 165 

GTG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG CTC ACC GTG CGC CTG 819 
Val Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu Leu Thr Val Arg Leu 
170 175 180 185 

GGG ACC TGG GGC CAC CCC GCC TTC CCC TCC TGC GGG AGG CTC AAG GAG 867 
Gly Thr Trp Gly His Pro Ala Phe Pro Ser Cys Gly Arg Leu Lys Glu 

190 195 200 

GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAC GCC AAC AGC ACC AGC 915 
Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Asp Ala Asn ser Thr Ser 
205 210 215 

CGC GCG CCG GCC GCC TTC CGA GCC TCT TTC CCC CCT CTG GAG ACG GGC 963 
Arg Ala Pro Ala Ala Phe Arg Ala Ser Phe Pro Pro Leu Glu Thr Gly 
220 225 230 

ECGG AAC CTC AAG AAG GAG GTC AGC CGG GTG CTG TGC AAG CGG TGC GCC 1011 
"MArg Asn Leu Lys Lys Glu Val Ser Arg Val Leu Cys Lys Arg Cys Ala 
W 235 240 245 

ilTTG CCT CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG GCT GCA GGT 1059 
iLeu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala Gly 
3250 255 260 265 

L.TCC AAA CTA GTC CTT CGG TGT GAA ACC AGT TCT GAA TAC TCC TCT CTC 1107 
L"=Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
~ 270 175 180 

l^AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA AAA AAC AAA 1155 
-Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys 
m 185 190 195 

CCA CAA AAT ATC AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC 1203 
Pro Gin Asn He Lys He Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
200 205 210 

ATT AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG TGC AAA GTG 1251 
He Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
215 220 225 

ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG 1299 
He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn He Thr He Val 
230 235 240 245 
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GAA TCA AAC GCT ACA TCT ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA 1347 
Glu Ser Asn Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 

250 255 260 

AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1395 
Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
265 270 275 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG TGC AAG TGC 1443 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 
280 285 290 

CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC 1491 
Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser 
295 300 305 

TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA 1530 
Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
400 405 410 

TAGGAGCATG CTCAGTTGGT GCTGCTTTCT TGTTGCTGCA TCTCCCCTCA GATTCCACCT 1590 

AGAGCTAGAT GTGTCTTACC AGATCTAATA TTGACTGCCT CTGCCTGTCG CATGAGAACA 1650 

TTAACAAAAG CAATTGTATT ACTTCCTCTG TTCGCGACTA GTTGGCTCTG AGATACTAAT 1710 

OAGGTGTGTGA GGCTCCGGAT GTTTCTGGAA TTGATATTGA ATGATGTGAT ACAAATTGAT 1770 

OOAGTCAATATC AAGCAGTGAA ATATGATAAT AAAGGCATTT CAAAGTCTCA CTTTTATTGA 1830 

VTAAAATAAAA ATCATTCTAC TGAACAGTCC ATCTTCTTTA TACAATGACC ACATCCTGAA 1890 

LuAAGGGTGTTG CTAAGCTGTA ACCGATATGC ACTTGAAATG ATGGTAAGTT AATTTTGATT 1950 

ffiCAGAATGTGT TATTTGTCAC AAATAAACAT AATAAAAGGA AAAAAAAAAA AAA 2003 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 11 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Xaa Lys 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 9 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Glu Thr Ser Ser Ser Gly Leu Xaa Leu Lys 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



^la Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 
3 1 5 10 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 25: 

\ (i) SEQUENCE CHARACTERISTICS: 

} (A) LENGTH: 9 

\ (B) TYPE: amino acid 

I (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 7 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Gly Tyr Phe Ala Glu Xaa Ala Arg 
1 5 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Thr Thr Glu Met Ala Ser Glu Gin Gly Ala 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



:=jAla Lys Glu Ala Leu Ala Ala Leu Lys 

y i 5 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 
I (C) STRANDEDNESS: 

* (D) TOPOLOGY: linear 

: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



_l Phe Val Leu Gin Ala Lys Lys 
1 5 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Glu Thr Gin Pro Asp Pro Gly Gin lie Leu Lys Lys Val Pro Met Val 
15 10 15 

lie Gly Ala Tyr Thr 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
3 (ix) FEATURE: 

U (D) OTHER INFORMATION: Xaa in positions 1, 3, 17 and 19 is 

j unknown. 

h (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

*Xaa Glu Xaa Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu 
" 15 10 15 

=Xaa Gly Xaa Gly Lys 

f, 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 6 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Lys Leu Glu Phe Leu Xaa Ala Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

I (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 

j (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

=Xaa Val His Gin Val Trp Ala Ala Lys 
f 1 5 



0(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 34: 
K (i) SEQUENCE CHARACTERISTICS: 

N (A) LENGTH: 14 

*I3 (B) TYPE: amino acid 

m ( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine, 

Xaa in position 11 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Xaa Tyr lie Phe Phe Met Glu Pro Glu Ala Xaa Ser Ser Gly 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: . . 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine, 

Xaa in position 13 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Xaa Leu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Xaa Trp Phe Val Val lie Glu Gly Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Xaa Ala Ser Pro Val Ser Val Gly Ser Val Gin Glu Leu Val Gin Arg 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Xaa Val Cys Leu Leu Thr Val Ala Ala Leu Pro Pro Thr 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

i (D) TOPOLOGY: linear 

\ (ix) FEATURE: 

"~ 4 (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

'£ Xaa in position 6 is unknown. 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

-JXaa Asp Leu Leu Leu Xaa Val 
1 5 



R(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 40: 
n (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Thr Cys Gly Cys Cys Lys Cys Cys Arg Thr Thr Cys Ala Cys Arg 
1 5 10 15 

Cys Ala Gly Ala Ala Gly Gly Thr Cys Thr Thr Cys Thr Cys Cys Thr 
20 25 30 

Thr Cys Thr Cys Ala Gly Cys 
35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



Cys Cys Thr Cys Gly Cys Thr Cys Cys Thr Thr Cys Thr Thr Cys Thr 
1 5 10 15 

Thr Gly Cys Cys Cys Thr Thr Cys 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Iaagtgcccaa ATGAGTTTAC TGGTGATCGC TGCCAAAACT acgtaatggc cagcttctac 



t(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 43: 

f (i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 36 

1 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



AGTACGTCCA CTCCCTTTCT GTCTCTGCCT GAATAG 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

AAGGCGGAGG AGCTGTACCA GAAGAGAGTG CTGACCATAA CCGGCATCTG CATCGCCCTC 60 

CTTGTGGTCG GCATCATGTG TGTGGTGGCC TACTGCAAAA CCAAGAAACA GCGGAAAAAG 120 

CTGCATGACC GTCTTCGGCA GAGCCTTCGG TCTGAACGAA ACAATATGAT GAACATTGCC 180 

AATGGGCCTC ACCATCCTAA CCCACCCCCC GAGAATGTCC AGCTGGTGAA TCAATACGTA 240 

TCTAAAAACG TCATCTCCAG TGAGCATATT GTTGAGAGAG AAGCAGAGAC ATCCTTTTCC 300 

ACCAGTCACT ATACTTCCAC AGCCCATCAC TCCACTACTG TCACCCAGAC TCCTAGCCAC 360 

AGCTGGAGCA ACGGACACAC TGAAAGCATC CTTTCCGAAA GCCACTCTGT AATCGTGATG 420 

TCATCCGTAG AAAACAGTAG GCACAGCAGC CCAACTGGGG GCCCAAGAGG ACGTCTTAAT 480 

GGCACAGGAG GCCCTCGTGA ATGTAACAGC TTCCTCAGGC ATGCCAGAGA AACCCCTGAT 540 

TCCTACCGAG ACTCTCCTCA TAGTGAAAG 569 



60(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 45: 

V 1 (i) SEQUENCE CHARACTERISTICS: 

Id (A) LENGTH: 8 

m (B) TYPE: amino acid 

q (C) STRANDEDNESS: 

~r, (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ival His Gin Val Trp Ala Ala Lys 

ST 1 5 



[ (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: . . 

(D) OTHER INFORMATION: Xaa in positxon 10 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Tyr lie Phe Phe Met Glu Pro Glu Ala Xaa Ser Ser Gly 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Leu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 
1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

PTrp Phe Val Val lie Glu Gly Lys 

1 1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



Ala Ser Pro Val Ser Val Gly. Ser Val Gin Glu Leu Val Gin Arg 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Val Cys Leu Leu Thr Val Ala Ala Leu Pro Pro Thr 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

( C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

rtys Val His Gin Val Trp Ala Ala Lys 

=1 1 5 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ix) FEATURE: . . . 

(D) OTHER INFORMATION: Xaa in position 12 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



Lvs Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Xaa Lys 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 5 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Asp Leu Leu Leu Xaa Val 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
I (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



^TTYAARGGNG AYGCNCAYAC 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



CATRTAYTCR TAYTCRTCNG C 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



TGYTCNGANG CCATYTCNGT 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



tlTGYTCRCTNG CCATYTCNGT 20 



h. (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 58: 

u. (i) SEQUENCE CHARACTERISTICS: 

[ n (A) LENGTH: 20 

* = (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
£ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



§CCDATNACCA TNGGNACYTT 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GCNGCCCANA CYTGRTGNAC 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



GCYTCNGGYT CCATRAARAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



v'tCYTCDATNA CNACRAACCA 20 



H>(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

vu (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

|,* (B) TYPE: nucleic acid 

f-j (C) STRANDEDNESS: single 

m (D) TOPOLOGY: linear 

l" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



* TCNGCRAART ANCCNGC 17 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



GCNGCNAGNG CYTCYTTNGC 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



GCNGCYAANG CYTCYTTNGC 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



OTTYTTNGCYT GNAGNACRAA 20 



S (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER* 

lI (i) SEQUENCE CHARACTERISTICS: 

? ~ (A) LENGTH: 20 

(B) TYPE: nucleic acid 

? (C) STRANDEDNESS: single 

tl (D) TOPOLOGY: linear 

y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



,5 TTYTTNGCYT GYAANACRAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



TGNACNAGYT CYTGNAC 17 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



TGNACYAAYT CYTGNAC 17 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CATRTAYTCN CCNGARTCNG C 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



CATRTAYTCN CCRCTRTCNG C 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



NGARTCNGCY AANGANGCYT T 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



NGARTCNGCN AGNGANGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



Grctrtcngcy AANGANGCYT T 21 



-(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

h (i) SEQUENCE CHARACTERISTICS: 

U (A) LENGTH: 21 

\& (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

:\ (D) TOPOLOGY: linear 

E (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



%CTRTCNGCN AGNGANGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



NGARTCNGCY AARCTNGCYT T 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



NGARTCNGCN AGRCTNGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



IGTATGTGTCA GCCATGACCA CCCCGGCTCG TATGTCACCT GTAGATTTCC ACACGCCAAG 
JCTCCCCCAAA TCGCCCCCTT CGGAAATGTC TCCACCCGTG TCCAGCATGA CGGTGTCCAT 
1GCCTTCCATG GCGGTCAGCC CCTTCATGGA AGAAGAGAGA CCTCTACTTC TCGTGACACC 
JACCAAGGCTG CGGGAGAAGA AGTTTGACCA TCACCCTCAG CAGTTCAGCT CCTTCCACCA 
-CAACCCCGCG CATGACAGTA ACAGCCTCCC TGCTAGCCCC TTGAGGATAG TGGAGGATGA 
iGGAGTATGAA ACGACCCAAG agtacgagcc AGCCCAAGAG CCTGTTAAGA AACTCGCCAA 
TAGCCGGCGG GCCAAAAGAA CCAAGCCCAA TGGCCACATT GCTAACAGAT TGGAAGTGGA 
CAGCAACACA AGCTCCCAGA GCAGTAACTC AGAGAGTGAA ACAGAAGATG AAAGAGTAGG 
= TGAAGATACG CCTTTCCTGG GCATACAGAA CCCCCTGGCA GCCAGTCTTG AGGCAACACC 
sTGCCTTCCGC CTGGCTGACA GCAGGACTAA CCCAGCAGGC CGCTTCTCGA CACAGGAAGA 
Saatccaggcc AGGCTGTCTA GTGTAATTGC TAACCAAGAC CCTATTGCTG tataaaacct 
iAAATAAACAC ATAGATTCAC CTGTAAAACT TTATTTTATA TAATAAAGTA TTCCACCTTA 
AATTAAACAA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



RCTRTCNGCY AARCTNGCYT T 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



RCTRCTNGCN AGRCTNGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



; jACN ACNGARA TGGCTCNNGA 20 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

= (i) SEQUENCE CHARACTERISTICS: 

M (A) LENGTH: 20 

Q (B) TYPE: nucleic acid 

lU (C) STRANDEDNESS: single 

7u (D) TOPOLOGY: linear 

»3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



ACNACNGARA TGGCAGYNGA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CAYCARGTNT GGGCNGCNAA 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



TTYGTNGTNA THGARGGNAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



"AARGGNGAYG CNCAYACNGA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



GARGCNYTNG CNGCNYTNAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



GTNGGNTCNG TNCARGARYT 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 87: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



GTNGGNAGYG TNCARGARYT 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 88: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



' jNACYTT YTTN ARDATYTGNC C 21 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 89: 
y (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 
* (B) TYPE: nucleic acid 

- (C) STRANDEDNESS: single 

- (D) TOPOLOGY: linear 
L : . (ix) FEATURE: 

3 (D) OTHER INFORMATION: Xaa in positions 14, 23, 90, 100, 126, 

H and 135 is a stop codon. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

TCTAA AAC TAC AGA GAC TGT ATT TTC ATG ATC ATC ATA GTT CTG TGA AAT ATA 53 
Asn Tyr Arg Asp Cys lie Phe Met lie lie lie Val Leu Xaa Asn lie 
15 10 15 

CTT AAA CCG CTT TGG TCC TGA TCT TGT AGG AAG TCA GAA CTT CGC ATT 101 
Leu Lys Pro Leu Trp Ser Xaa Ser Cys Arg Lys Ser Glu Leu Arg lie 
20 25 30 

AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC 149 
Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Ser Met Cys Lys Val lie 
35 40 45 
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AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG 197 
Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie Arg lie Val Glu 
50 55 60 

TCA AAC GGT AAG AGA TGC CTA CTG CGT GCT ATT TCT CAG TCT CTA AGA 245 
Ser Asn Gly Lys Arg Cys Leu Leu Arg Ala lie Ser Gin Ser Leu Arg 
€5 70 75 80 

GGA GTG ATC AAG GTA TGT GGT CAC ACT TGA ATC ACG CAG GTG TGT GAA 293 
Gly Val lie Lys Val Cys Gly His Thr Xaa lie Thr Gin Val Cys Glu 

85 90 95 

ATC TCA TTG TGA ACA AAT AAA AAT CAT GAA AGG AAA ACT CTA TGT TTG 341 
lie Ser Cys Xaa Thr Asn Lys Asn His Glu Arg Lys Thr Leu Cys Leu 
100 105 110 

AAA TAT CTT ATG GGT CCT CCT GTA AAG CTC TTC ACT CCA TAA GGT GAA 389 
Lys Tyr Leu Met Gly Pro Pro Val Lys Leu Phe Thr Pro Xaa Gly Glu 
115 120 125 

ATA GAC CTG AAA TAT ATA TAG ATT ATT T 417 
lie Asp Leu Lys Tyr lie Xaa lie lie 
130 135 



=^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 90: 
T ; (i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 3 3 

(B) TYPE: nucleic acid 
3 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
ri (ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 19, 25, and 31 is Inosine. 
=H Y can be cytidine or thymidine. 

1j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

J CCGAATTCTG CAGGARACNC ARCCNGAYCC NGG 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 91: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 14, 20, 23, 29, and 35 is 

Inosine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AAGGATCCTG CAGNGTRTAN GCNCCDATNA CCATNGG 37 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 92: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3 (D) TOPOLOGY: linear 

33 (ix) FEATURE: 

\f (D) OTHER INFORMATION: N at positions 16, 21, and 24 is Inosine. 

M Y can be cytidine or thymidine. 

Jti (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

QCCGAATTCTG CAGGCNGAYT CNGGNGARTA YATG 34 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 93: 
% (i) SEQUENCE CHARACTERISTICS: 

f? (A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16 and 25 is Inosine. Y 

can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCGAATTCTG CAGGCNGAYA GYGGNGARTA YAT 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 14, 15, 16, 26, and 29 is 

Inosine. Y can be cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 



AAGGATCCTG CAGNNNCATR TAYTCNCCNG ARTC 34 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 14, 15, 16, and 26 is 

Inosine. Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



^AAGGATCCTG CAGNNNCATR TAYTCNCCRC TRTC 34 



J (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 96: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 21, 28, and 31 is Inosine. 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CCGAATTCTG CAGCAYCARG TNTGGGCNGC NAA 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 97: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 31 is Inosine. Y can be 

cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CCGAATTCTG CAGATHTTYT TYATGGARCC NGARG 35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
« (D) TOPOLOGY: linear 

1 (ix) FEATURE: 

* (D) OTHER INFORMATION: N at positions 18, 21, 24, 27, and 33 is 

f Inosine. Y can be cytidine or thymidine. 

} (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

■CCGAATTCTG CAGGGGGNCC NCCNGCNTTY CCNGT 35 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 99: 
J (i) SEQUENCE CHARACTERISTICS: 

\ (A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: ^ 

(D) OTHER INFORMATION: N at positions 21 and 24 is Inosine. Y 

• can be cytidine or thymidine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

CCGAATTCTG CAGTGGTTYG TNGTNATHGA RGG 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 100: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 17, 20 , and 26 is Inosine, 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

AAGGATCCTG CAGYTTNGCU NGCCCANACY TGRTG 35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 101: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 19 is Inosine. Y can be 

cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

AAGGATCCTG CAGGCYTCNG GYTCCATRAA RAA 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 102: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16, 22, 25, 28, and 31 is 

Inosine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
AAGGATCCTG CAGACNGGRA ANGCNGGNGG NCC 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 103: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 17, 26, and 29 is Inosine. 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

AAGGATCCTG CAGYTTNCCY TCDATNACNA CRAAC 35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 104: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 18 is Inosine. Y can be 

cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

1 CATRTAYTCR TAYTCTCNGC AAGGATCCTG CAG 33 



'(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 105: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 19, 25, and 31 is Inosine, 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CCGAATTCTG CAGAARGGNG AYGCNCAYAC NGA 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 106: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3 and 18 is Inosine. Y can 

be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



GCNGCYAANG CYTCYTTNGC AAGGATCCTG CAG 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 107: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3, 6, 9, and 18 is Inosine. 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



GCNGCNAGNG CYTCYTTNGC AAGGATCCTG CAG 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 108: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3, 12, and 15 is Inosine. Y 

can be cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



TCNGCRAART ANCCNGCAAG GATCCTGCAG 30 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



CATCGATCTG CAGGCTGATT CTGGAGAATA TATGTGCA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



AAGGATCCTG CAGCCACATC TCGAGTCGAC ATCGATT 



*(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



iCCGAATTCTG CAGTGATCAG CAAACTAGGA AATGACA 3 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CATCGATCTG CAGCCTAGTT TGCTGATCAC TTTGCAC 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



AAGGATCCTG CAGTATATTC TCCAGAATCA GCCAGTG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



; AAGGATCCTG CAGGCACGCA GTAGGCATCT CTTA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



CCGAATTCTG CAGCAGAACT TCGCATTAGC AAAGC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



CATCCCGGGA TGAAGAGTCA GGAGTCTGTG GCA 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



ATACCCGGGC TGCAGACAAT GAGATTTCAC ACACCTGCG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



AAGGATCCTG CAGTTTGGAA CCTGCCACAG ACTCCT 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



wATACCCGGGC TGCAGATGAG ATTTCACACA CCTGCGTGA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



His Gin Val Trp Ala Ala Lys Ala Ala Gly Leu Lys 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 121: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



Gly Gly Leu Lys Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Ala Asn 
15 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 122: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is unknown. 
3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

JYLevL Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 
« 1 5 10 



; (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 123: 

(i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 23 

* (B) TYPE: amino acid 

I (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
J (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Leu Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser 
1 5s 10 15 

Cys Gly Arg Leu Lys Glu Asp 
20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 124: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 10 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Tyr lie Phe Phe Met Glu Pro Glu Ala Xaa Ser Ser Gly 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 125: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

jLys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu Ala Asn Ser 

= j 1 5 10 15 

iSer Gly Gly Pro Gly Arg Leu 

"■f 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 126: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



Val Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 127: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Glu Tyr Lys Cys Leu Lys Phe Lys Trp Phe Lys Lys Ala Thr Val Met 
15 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 128: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys 

5 1 5 10 15 

%iAsn Gly Ser Glu Leu Ser Arg Lys Asn Lys 
jJ 20 25 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 129: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Xaa Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 130: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
1 5 10 15 

Cys Lys Val lie Ser Lys Leu 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 131: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

C (D) TOPOLOGY: linear 

m (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

:^Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 
« 1 5 10 



'(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 132: 
1 (i) SEQUENCE CHARACTERISTICS: 

J (A) LENGTH: 22 

j (B) TYPE: amino acid 

j (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
15 10 15 

Lys Val lie Ser Lys Leu 
20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 133: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



CCTGCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 55 
His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 
15 10 15 
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ATC ACC ATT GTG GAG TCA AAC GGT AAG AGA TGC CTA CTG CGT GCT ATT 583 
lie Thr lie Val Glu Ser Asn Gly Lys Arg Cys Leu Leu Arg Ala lie 
180 185 190 

TCT CAG TCT CTA AGA GGA GTG ATC AAG GTA TGT GGT CAC ACT 625 
Ser Gin Ser Leu Arg Gly Val lie Lys Val Cys Gly His Thr 
195 200 205 

TGAATCACGC AGGTGTGTGA AATCTCATTG TGAACAAATA AAAATCATGA AAGGAAAAAA 685 
AAAAAAAAAA AATCGATGTC GACTCGAGAT GTGGCTGCAG GTCGACTCTA GAGGATCCC 744 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 134: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



CfCCTGCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 55 
DC His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 

V 1 5 10 15 

ffCTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 103 

uLeu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 

1=1 20 25 30 

r'GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 

u.Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 

O 35 40 45 

L-nGCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 199 

; ^Ala Lys Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 

J:' 50 55 60 

TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT GTG 247 
Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 . 75 80 

CAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 295 
Gin Arg Cys Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 

85 90 95 

TCT GTG GCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 343 

Ser Val Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
100 105 110 
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TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 
Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser 
115 120 125 



391 



CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lys Asn Lys Gly Gly Asn lie Lys lie Gin Lys Arg Pro Gly Lys 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 487 
Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 

165 170 175 

ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 583 
lie Thr lie Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly Thr 

180 185 190 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 631 
Ser His Leu Val Lys Ser Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
195 200 205 

%GA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 679 
%ly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr . 
J 210 215 220 

3TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 727 
*Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 

13225 230 235 240 

!a6T6 CCC ATG AAA GTC CAA ACC CAA GAA AGT GCC CAA ATG AGT TTA CTG 775 
=Val Pro Met Lys Val Gin Thr Gin Glu Ser Ala Gin Met Ser Leu Leu 

lj 245 250 255 

«GTG ATC GCT GCC AAA ACT ACG TAATGGCCAG CTTCTACAGT ACGTCCACTC 826 
«Val lie Ala Ala Lys Thr Thr 

* 260 

CCTTTCTGTC TCTGCCTGAA TAGCGCATCT CAGTCGGTGC CGCTTTCTTG TTGCCGCATC 886 

TCCCCTCAGA TTCCTCCTAG AGCTAGATGC GTTTTACCAG GTCTAACATT GACTGCCTCT 946 

GCCTGTCGCA TGAGAACATT AACACAAGCG ATTGTATGAC TTCCTCTGTC CGTGACTAGT 1006 

GGGCTCTGAG CTACTCGTAG GTGCGTAAGG CTCCAGTGTT TCTGAAATTG ATCTTGAATT 1066 

ACTGTGATAC GACATGATAG TCCCTCTCAC CCAGTGCAAT GACAATAAAG GCCTTGAAAA 1126 

GTCAAAAAAA AAAAAAAAAA AAAAAATCGA TGTCGACTCG AGATGTGGCT GCAGGTCGAC 1186 

TCTAGAG 1193 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 135: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1108 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



CCTGCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 55 

His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp ser Leu 
15 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 103 
Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 
Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 
35 40 45 

GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 199 
Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 

5 50 55 60 

JTCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT GTG 247 
?JSer Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 

f,65 70 75 80 

I CAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 295 
nGln Arg Cys Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 

85 90 95 

"TCT GTG GCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 343 
"Ser Val Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
jj 100 105 110 

S TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 391 
F Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lys Asn Lys Pro Glu Asn lie Lys lie Gin Lys Arg Pro Pro Lys 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 487 
Ser Glu Leu Arg lie Ser Lys Ala ser Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 

165 170 175 
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ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 583 
lie Arg lie Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly Thr 
180 185 190 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 631 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
195 200 205 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 679 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
210 215 220 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 727 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
225 230 235 240 

GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT 775 
Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 

245 250 255 

GAA TAGCGCATCT CAGTCGGTGC CGCTTTCTTG TTGCCGCATC TCCCCTCAGA TTCCGCCTAG 838 
Glu 



□AGCTAGATGC GTTTTACCAG GTCTAACATT GACTGCCTCT GCCTGTCGCA TGAGAACATT 898 

IBAACACAAGCG ATTGTATGAC TTCCTCTGTC CGTGACTAGT GGGCTCTGAG CTACTCGTAG 958 

SiGTGCGTAAGG CTCCAGTGTT TCTGAAATTG ATCTTGAATT ACTGTGATAC GACATGATAG 1018 

yTCCCTCTCAC CCAGTGCAAT GACAATAAAG GCCTTGAAAA GTCAAAAAAA AAAAAAAAAA 1078 

SAAAAATCGAT GTCGACTCGA GATGTGGCTG 1108 



""(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 136: 
?. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 

(B) TYPE: nucleic acid 

LH (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

= ^f (ix) FEATURE: 

51 (D) OTHER INFORMATION: N in position 214 is unknown, 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 



AGTTTCCCCC CCCAACTTGT CGGAACTCTG GGCTCGCGCG CAGGGCAGGA GCGGAGCGGC 60 

GGCGGCTGCC CAGGCGATGC GAGCGCGGGC CGGACGGTAA TCGCCTCTCC CTCCTCGGGC 120 

TGCGAGCGCG CCGGACCGAG GCAGCGACAG GAGCGGACCG CGGCGGGAAC CGAGGACTCC 180 

CCAGCGGCGC GCCAGCAGGA GCCACCCCGC GAGNCGTGCG ACCGGGACGG AGCGCCCGCC 24 0 

AGTCCCAGGT GGCCCGGACC GCACGTTGCG TCCCCGCGCT CCCCGCCGGC GACAGGAGAC 300 

GCTCCCCCCC ACGCCGCGCG CGCCTCGGCC CGGTCGCTGG CCCGCCTCCA CTCCGGGGAC 360 

AAACTTTTCC CGAAGCCGAT CCCAGCCCTC GGACCCAAAC TTGTCGCGCG TCGCCTTCGC 420 

CGGGAGCCGT CCGCGCAGAG CGTGCACTTC TCGGGCGAG ATG TCG GAG CGC AGA 474 

Met Ser Glu Arg Arg 
1 5 
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GAA GGC AAA GGC AAG GGG AAG GGC GGC AAG AAG GAC CGA GGC TCC GGG 522 
Glu Gly Lys Gly Lys Gly Lys Gly Gly Lys Lys Asp Arg Gly Ser Gly 

10 15 20 

AAG AAG CCC GTG CCC GCG GCT GGC GGC CCG AGC CCA G 559 
Lys Lys Pro Val Pro Ala Ala Gly Gly Pro Ser Pro Ala 
25 30 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 137: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in position 8 could be either A or G. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



CC CAT CAN GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG 
His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser 
1 5 10 15 

j CTG CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC 
j Leu Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser 

20 25 30 

TGC GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC 
?Cys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro 
35 40 45 

I GAG GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC 
i Glu Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro 
50 55 60 

! CCC TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT 
Pro Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala 
65 70 75 

GTG CAA CGG TGC G 
Val Gin Arg Cys 
80 



47 



95 



143 



191 



239 



252 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 138: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



CCT TGC CTC CCC GCT TGA AAG AGA TGA AGA GTC AGG AGT CTG TGG CAG 
Leu Pro Pro Arg Leu Lys Glu His Lys Ser Gin Glu Ser Val Ala Gly 
15 10 15 

GTT CCA AAC TAG TGC TTC GGT GCG AGA CCA GTT CTG AAT ACT CCT CTC 
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
20 25 30 

TCA AGT TCA AGT GGT TCA AGA ATG GGA GTG AAT TAA GCC GAA AGA ACA 
Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser Arg Lys Asn Lys 
35 40 45 

AAC CAC AAA ACA TCA AGA TAC AGA AAA GGC CGG G 
Pro Gly Asn lie Lys lie Gin Lys Arg Pro Gly 
- 50 55 



3(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 139: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 122 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



[fG AAG TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA 
Lys Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly 
l 5 10 15 

GAA TAT ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT 
Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser 

20 25 30 

GCC AAC ATC ACC ATT GTG GAG TCA AAC G 
Ala Asn lie Thr lie Val Glu Ser Asn Ala 
35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 140: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TCTAAAACTA CAGAGACTGT ATTTTCATGA TCATCATAGT TCTGTGAAAT ATACTTAAAC 
CGCTTTGGTC CTGATCTTGT AGG AAG TCA GAA CTT CGC ATT AGC AAA GCG 

Lys Ser Glu Leu Arg lie Ser Lys Ala 
1 5 

TCA CTG GCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC AGC AAA CTA 
Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
10 15 20 25 

GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC GGT 
Gly Asn Asp Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Gly 

30 35 40 

AAG AGA TGC CTA CTG CGT GCT ATT TCT CAG TCT CTA AGA GGA GTG ATC 
.Lys Arg Cys Leu Leu Arg Ala He Ser Gin Ser Leu Arg Gly Val He 
D 45 50 55 

; ^AAG GTA TGT GGT CAC ACT TGAATCACGC AGGTGTGTGA AATCTCATTG 
WLys Val Cys Gly His Thr 

yl 60 

^TGAACAAATA AAAATCATGA AAGGAAAACT CTATGTTTGA AATATCTTAT GGGTCCTCCT 
£ GTAAAGCTCT TCACTCCATA AGGTGAAATA GACCTGAAAT ATATATAGAT TATTT 



% (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 141: 
«j (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 
2 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AG ATC ACC ACT GGC ATG CCA GCC TCA ACT GAG ACA GCG TAT GTG TCT 
Glu He Thr Thr Gly Met Pro Ala Ser Thr Glu Thr Ala Tyr Val Ser 
15 10 15 

TCA GAG TCT CCC ATT AGA ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 
Ser Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Thr Asn Thr 
20 25 30 

TCT TCA T 
Ser Ser Ser 
35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 142: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT GTG CCC 
Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro 
15 10 15 



48 



ATG AAA GTC CAA ACC CAA GAA 
Met Lys Val Gin Thr Gin Glu 
20 



69 



INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 143: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

MAAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG 48 
y3Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met 
7 15 10 15 

QGCC AGC TTC TAC 60 

ruAla Ser Phe Tyr 
m 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 144: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: §6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA TAG 36 
Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 145: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



AAG CAT CTT GGG ATT GAA TTT ATG GAG 27 
Lys His Leu Gly lie Glu Phe Met Glu 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 146: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

WAAA GCG GAG GAG CTC TAC CAG AAG AGA GTG CTC ACC ATT ACC GGC ATT 48 
ff^Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr lie Thr Gly lie 
□ l 5 10 15 

yTGC ATC GCG CTG CTC GTG GTT GGC ATC ATG TGT GTG GTG GTC TAC TGC 96 
~ Cys lie Ala Leu Leu Val Val Gly lie Met Cys Val Val Val Tyr Cys 
\a 20 25 30 

7r AAA ACC AAG AAA CAA CGG AAA AAG CTT CAT GAC CGG CTT CGG CAG AGC 144 
hi Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin Ser 
[p 35 40 45 

"CTT CGG TCT GAA AGA AAC ACC ATG ATG AAC GTA GCC AAC GGG CCC CAC 192 
Leu Arg Ser Glu Arg Asn Thr Met Met Asn Val Ala Asn Gly Pro His 
50 55 60 

CAC CCC AAT CCG CCC CCC GAG AAC GTG CAG CTG GTG AAT CAA TAC GTA 240 
His Pro Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val 
65 70 75 80 

TCT AAA AAT GTC ATC TCT AGC GAG CAT ATT GTT GAG AGA GAG GCG GAG 288 
Ser Lys Asn Val lie Ser Ser Glu His lie Val Glu Arg Glu Ala Glu 

85 90 95 

AGC TCT TTT TCC ACC AGT CAC TAC ACT TCG ACA GCT CAT CAT TCC ACT 336 
Ser Ser Phe Ser Thr Ser His Tyr Thr Ser Thr. Ala His His Ser Thr 
100 105 110 
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ACT GTC ACT CAG ACT CCC AGT CAC AGC 
Thr Val Thr Gin Thr Pro Ser His Ser 
115 120 

AGC ATC ATT TCG GAA AGC CAC TCT GTC 
Ser lie lie Ser Glu Ser His Ser Val 
130 135 

AAC AGT AGG CAC AGC AGC CCG ACT GGG 
Asn Ser Arg His Ser Ser Pro Thr Gly 
145 150 

GGC TTG GGA GGC CCT CGT GAA TGT AAC 

Gly Leu Gly Gly Pro Arg Glu Cys Asn 

165 

GAA ACC CCT GAC TCC TAC CGA GAC TCT 

Glu Thr Pro Asp Ser Tyr Arg Asp Ser 

180 185 



TGG AGC AAT GGA CAC ACT GAA 384 
Trp Ser Asn Gly His Thr Glu 
125 

ATC GTG ATG TCA TCC GTA GAA 432 
lie Val Met Ser Ser Val Glu 
140 

GGC CCG AGA GGA CGT CTC AAT 480 
Gly Pro Arg Gly Arg Leu Asn 

155 160 

AGC TTC CTC AGG CAT GCC AGA 528 
Ser Phe Leu Arg His Ala Arg 
170 175 

CCT CAT AGT G AAAG 569 
Pro His Ser 



i--(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 147: 

(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 730 

(B) TYPE: nucleic acid 
;i (C) STRANDEDNESS : single 

;£_: (D) TOPOLOGY: linear 

H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

■ G TAT GTA TCA GCA ATG ACC ACC CCG GCT CGT ATG TCA CCT GTA GAT 46 
M Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp 
Q 1 5 10 15 

fUTTC CAC ACG CCA AGC TCC CCC AAG TCA CCC CCT TCG GAA ATG TCC CCG 94 
iiPhe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro 
m 20 25 30 

CCC GTG TCC AGC ACG ACG GTC TCC ATG CCC TCC ATG GCG GTC AGT CCC 142 
Pro Val Ser Ser Thr Thr Val Ser Met Pro Ser Met Ala Val Ser Pro 
35 40 45 

TTC GTG GAA GAG GAG AGA CCC CTG CTC CTT GTG ACG CCA CCA CGG CTG 190 
Phe Val Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu 
50 55 60 

CGG GAG AAG TAT GAC CAC CAC GCC CAG CAA TTC AAC TCG TTC CAC TGC 238 
Arg Glu Lys Tyr Asp His His Ala Gin Gin Phe Asn Ser Phe His Cys 
65 70 75 
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AAC CCC GCG CAT GAG AGC AAC AGC CTG CCC CCC AGC CCC TTG AGG ATA 286 
Asn Pro Ala His Glu Ser Asn Ser Leu Pro Pro Ser Pro Leu Arg lie 
80 85 90 95 

GTG GAG GAT GAG GAA TAT GAA ACG ACC CAG GAG TAC GAA CCA GCT CAA 334 
Val Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin 

100 105 110 

GAG CCG GTT AAG AAA CTC ACC AAC AGC AGC CGG CGG GCC AAA AGA ACC 382 
Glu Pro Val Lys Lys Leu Thr Asn Ser Ser Arg Arg Ala Lys Arg Thr 
115 120 125 

AAG CCC AAT GGT CAC ATT GCC CAC AGG TTG GAA ATG GAC AAC AAC ACA 430 
Lys Pro Asn Gly His lie Ala His Arg Leu Glu Met Asp Asn Asn Thr 
130 135 140 

GGC GCT GAC AGC AGT AAC TCA GAG AGC GAA ACA GAG GAT GAA AGA GTA 478 
Gly Ala Asp Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val 
145 150 155 

GGA GAA GAT ACG CCT TTC CTG GCC ATA CAG AAC CCC CTG GCA GCC AGT 526 
Gly Glu Asp Thr Pro Phe Leu Ala lie Gin Asn Pro Leu Ala Ala Ser 
160 165 170 175 

KCTC GAG GCG GCC CCT GCC TTC CGC CTG GTC GAC AGC AGG ACT AAC CCA 574 
SiLeu Glu Ala Ala Pro Ala Phe Arg Leu Val Asp Ser Arg Thr Asn Pro 
yj 180 185 190 

OACA GGC GGC TTC TCT CCG CAG GAA GAA TTG CAG GCC AGG CTC TCC GGT 622 
••iThr Gly Gly Phe Ser Pro Gin Glu Glu Leu Gin Ala Arg Leu Ser Gly 
ifl 195 200 205 

L.GTA ATC GCT AAC CAA GAC CCT ATC GCT GTC TAAAACCGAA ATACACCCAT 672 
Sval He Ala Asn Gin Asp Pro He Ala Val 

pr f 210 215 

; ijAGATTCACCT GTAAAACTTT ATTTTATATA ATAAAGTATT CCACCTTAAA TTAAACAA 730 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 148: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



AGTTTCCCCC CCCAACTTGT CGGAACTCTG GGCTCGCGCG CAGGGCAGGA GCGGAGCGGC 60 

GGCGGCTGCC CAGGCGATGC GAGCGCGGGC CGGACGGTAA TCGCCTCTCC CTCCTCGGGC 120 

TGCGAGCGCG CCGGACCGAG GCAGCGACAG GAGCGGACCG CGGCGGGAAC CGAGGACTCC 180 

157 



CCAGCGGCGC GCCAGCAGGA GCCACCCCGC GAGCGTGCGA CCGGGACGGA GCGCCCGCCA 240 
GTCCCAGGTG GCCCGGACCG CACGTTGCGT CCCCGCGCTC CCCGCCGGCG ACAGGAGACG 300 
CTCCCCCCCA CGCCGCGCGC GCCTCGGCCC GGTCGCTGGC CCGCCTCCAC TCCGGGGACA 360 
AACTTTTCCC GAAGCCGATC CCAGCCCTCG GACCCAAACT TGTCGCGCGT CGCCTTCGCC 420 
GGGAGCCGTC CGCGCAGAGC GTGCACTTCT CGGGCGAG ATG TCG GAG CGC AGA 473 

Met Ser Glu Arg Arg 
1 5 

GAA GGC AAA GGC AAG GGG AAG GGC GGC AAG AAG GAC CGA GGC TCC GGG 521 
Glu Gly Lys Gly Lys Gly Lys Gly Gly Lys Lys Asp Arg Gly Ser Gly 

10 1 15 20 

AAG AAG CCC GTG CCC GCG GCT GGC GGC CCG AGC CCA GCC TTG CCT CCC 569 
Lys Lys Pro Val Pro Ala Ala Gly Gly Pro Ser Pro Ala Leu Pro Pro 

25 30 35 

CGC TTG AAA GAG ATG AAG ATG CAG GAG TCT GTG GCA GGT TCC AAA CTA 617 
Arg Leu Lys Glu Met Lys Ser Gin Glu Ser Val Ala Gly Ser Lys Leu 
40 45 50 

GTG CTT CGG TGC GAG ACC AGT TCT GAA TAC TCC TCT CTC AAG TTC AAG 665 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Lys Phe Lys 
55 60 65 

STGG TTC AAG AAT GGG AGT GAA TTA AGC CGA AAG AAC AAA CCA CAA AAC 713 
liTrp Phe Lys Asn Gly Ser Gly Leu Ser Arg Lys Asn Lys Pro Gin Asn 
,] 70 75 80 85 

-ATC AAG ATA CAG AAA AGG CCG GGG AAG TCA GAA CTT CGC ATT AGC AAA 761 
tile Lys lie Gin Lys Arg Pro Gly Lys Ser Glu Leu Arg lie Ser Lys 
~ 90 95 100 

GCG TCA CTG GCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC AGC AAA 809 
!!Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys 

f s 105 110 115 

UCTA GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC 857 
flLeu Gly Asn Asp Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn 
Ji 120 125 130 

GAG ATC ACC ACT GGC ATG CCA GCC TCA ACT GAG ACA GCG TAT GTG TCT 905 
Glu He Thr Thr Gly Met Pro Ala Ser Thr Glu Thr Ala Tyr Val Ser 
135 140 145 

TCA GAG TCT CCC ATT AGA ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 953 
Ser Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Thr Asn Thr 
150 155 160 165 

TCT TCA TCC ACA TCC ACA TCT ACA GCT GGG ACA AGC CAT CTT GTC AAG 1001 
Ser Ser Ser Thr Ser Thr Ser Thr Ala Gly Thr Ser His Leu Val Lys 

170 175 180 
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TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGC GAG TGC TTC 
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe 
185 190 195 

ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC TTG TGC AAG TGC CCA 
Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro 
200 205 210 

AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC 
Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
215 220 225 

TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA TAGGCGCATG 
Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
230 235 240 

CTCAGTCGGT GCCGCTTTCT TGTTGCCGCA TCTCCCCTCA GATTCAACCT AGAGCTAGAT 
GCGTTTTACC AGGTCTAACA TTGACTGCCT CTGCCTGTCG CATGAGAACA TTAACACAAG 
CGATTGTATG ACTTCCTCTG TCCGTGACTA GTGGGCTCTG AGCTACTCGT AGGTGCGTAA 
GGCTCCAGTG TTTCTGAAAT TGATCTTGAA TTACTGTGAT ACGACATGAT AGTCCCTCTC 
ACCCAGTGCA ATGACAATAA AGGCCTTGAA AAGTCTCACT TTTATTGAGA AAATAAAAAT 
CGTTCCACGG GACAGTCCCT CTTCTTTATA AAATGACCCT ATCCTTGAAA AGGAGGTGTG 
-TTAAGTTGTA ACCAGTACAC ACTTGAAATG ATGGTAAGTT CGCTTCGGTT CAGAATGTGT 
iDCTTTCTGAC AAATAAACAG AATAAAAAAA AAAAAAAAAA A 



3(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 149: 
* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1140 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 



Scat can gtg tgg gcg gcg aaa gcc ggg ggc ttg aag aag gac tcg ctg 

His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 
1 5 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 
Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 
Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 
35 40 45 

GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 
Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 
50 55 60 
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TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT GTG 240 
Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 75 80 

CAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 288 
Gin Arg Cys Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
- 85 90 95 

TCT GTG GCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 336 
Ser Val Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
100 105 110 

TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 384 
Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 432 
Arg Lys Asn Lys Pro Glu Asn lie Lys lie Gin Lys Arg Pro Gly Lys 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 480 
Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 

HATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 528 
CiMet Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 
j 165 170 175 

=ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 576 
"Ille Thr lie Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly Thr 
* n 180 185 190 

; AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 624 
ITSer His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
195 200 205 

tlGGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 672 
UGly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
JS 210 215 220 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 720 
Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
225 230 235 240 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 768 
Val Pro Met Lys Val Gin Thr Gin Glu Lys Cys Pro Asn Glu Phe Thr 

245 250 255 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC 816 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser 
260 265 270 
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ACT CCC TTT CTG TCT CTG CCT GAA TAGCGCATCT CAGTCGGTGC CGCTTTCTTG 870 
Thr Pro Phe Leu Ser Leu Pro Glu 

275 280 

TTGCCGCATC TCCCCTCAGA TTCCNCCTAG AGCTAGATGC GTTTTACCAG GTCTAACATT 930 

GACTGCCTCT GCCTGTCGCA TGAGAACATT AACACAAGCG ATTGTATGAC TTCCTCTGTC 990 

CGTGACTAGT GGGCTCTGAG CTACTCGTAG GTGCGTAAGG CTCCAGTGTT TCTGAAATTG 1050 

ATCTTGAATT ACTGTGATAC GACATGATAG TCCCTCTCAC CCAGTGCAAT GACAATAAAG 1110 

GCCTTGAAAA GTCAAAAAAA AAAAAAAAAA 1140 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 150: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1764 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

G AAG TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA 49 
Lys Ser Glu Leu Arg He Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu 
O l 5 10 15 

SlT AT ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC 97 
yJTyr Met Cys Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 

ffl 20 25 30 

MAAC ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG 145 
InAsn He Thr He Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly 
35 40 45 

E|ACA AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG 193 
STlThr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val 

LH 50 55 60 

S ; AAT GGA GGC GAC TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA 241 
01 Asn Gly Gly Asp Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg 
65 70 75 80 

TAC TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG 289 
Tyr Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu 

85 90 95 

AAT GTG CCC ATG AAA GTC CAA ACC CAA GAA AAA GCG GAG GAG CTC TAC 337 
Asn Val Pro Met Lys Val Gin Thr Gin Glu Lys Ala Glu Glu Leu Tyr 
100 105 HO 

CAG AAG AGA GTG CTC ACC ATT ACC GGC ATT TGC ATC GCG CTG CTC GTG 385 
Gin Lys Arg Val Leu Thr He Thr Gly He Cys He Ala Leu Leu Val 
115 120 125 
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GTT GGC ATC ATG TGT GTG GTG GTC TAC TGC AAA ACC AAG AAA CAA CGG 433 
Val Gly He Met Cys Val Val Val Tyr Cys Lys Thr Lys Lys Gin Arg 
130 135 140 

AAA AAG CTT CAT GAC CGG CTT CGG CAG AGC CTT CGG TCT GAA AGA AAC 481 
Lys Lys Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn 
145 150 155 160 

ACC ATG ATG AAC GTA GCC AAC GGG CCC CAC CAC CCC AAT CCG CCC CCC 529 
Thr Met Met Asn Val Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 

165 170 175 

GAG AAC GTG CAG CTG GTG AAT CAA TAC GTA TCT AAA AAT GTC ATC TCT 577 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He Ser 
180 185 190 

AGC GAG CAT ATT GTT GAG AGA GAG GCG GAG AGC TCT TTT TCC ACC AGT 625 
Ser Glu His He Val Glu Arg Glu Ala Glu Ser Ser Phe Ser Thr Ser 
195 200 205 

CAC TAC ACT TCG ACA GCT CAT CAT TCC ACT ACT GTC ACT CAG ACT CCC 673 
His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 
210 215 220 

ilAGT CAC AGC TGG AGC AAT GGA CAC ACT GAA AGC ATC ATT TCG GAA AGC 721 
Iser His Ser Trp Ser Asn Gly His Thr Glu Ser He He Ser Glu Ser 
1^225 230 235 240 

"'CAC TCT GTC ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC 769 
7His Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser 
fi 245 250 255 

^CCG ACT GGG GGC CCG AGA GGA CGT CTC AAT GGC TTG GGA GGC CCT CGT 817 
3pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Leu Gly Gly Pro Arg 
U 260 265 270 

0GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA GAA ACC CCT GAC TCC TAC 865 
fGlu Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr 
275 280 285 

CGA GAC TCT CCT CAT AGT GAA AGA CAT AAC CTT ATA GCT GAG CTA AGG 913 
Arg Asp Ser Pro His Ser Glu Arg His Asn Leu He Ala Glu Leu Arg 
290 295 300 

AGA AAC AAG GCC CAC AGA TCC AAA TGC ATG CAG ATC CAG CTT TCC GCA 961 
Arg Asn Lys Ala His Arg Ser Lys Cys Met Gin He Gin Leu Ser Ala 
305 310 315 320 

ACT CAT CTT AGA GCT TCT TCC ATT CCC CAT TGG GCT TCA TTC TCT AAG 1009 
Thr His Leu Arg Ala Ser Ser He Pro His Trp Ala Ser Phe Ser Lys 

325 330 335 
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ACC CCT TGG CCT TTA GGA AGG TAT GTA TCA GCA ATG ACC ACC CCG GCT 1057 
Thr Pro Trp Pro Leu Gly Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
340 345 350 

CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA AGC TCC CCC AAG TCA CCC 1105 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro 
355 360 365 

CCT TCG GAA ATG TCC CCG CCC GTG TCC AGC ACG ACG GTC TCC ATG CCC 1153 
Pro Ser Glu Met Ser Pro Pro Val Ser Ser Thr Thr Val Ser Met Pro 
370 375 380 

TCC ATG GCG GTC AGT CCC TTC GTG GAA GAG GAG AGA CCC CTG CTC CTT 1201 
Ser Met Ala Val Ser Pro Phe Val Glu Glu Glu Arg Pro Leu Leu Leu 
385 390 395 400 

GTG ACG CCA CCA CGG CTG CGG GAG AAG TAT GAC CAC CAC GCC CAG CAA 1249 
Val Thr Pro Pro Arg Leu Arg Glu Lys Tyr Asp His His Ala Gin Gin 

405 410 415 

TTC AAC TCG TTC CAC TGC AAC CCC GCG CAT GAG AGC AAC AGC CTG CCC 1297 
Phe Asn Ser Phe His Cys Asn Pro Ala His Glu Ser Asn Ser Leu Pro 
M 420 425 430 

^CCC AGC CCC TTG AGG ATA GTG GAG GAT GAG GAA TAT GAA ACG ACC CAG 1345 
/""Pro Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr Thr Gin 
WW 435 440 445 

OGAG TAC GAA CCA GCT CAA GAG CCG GTT AAG AAA CTC ACC AAC AGC AGC 1393 
U Glu Yyj. Glu Pro Ala Gln Glu Pro Va i Lys Ly S Leu Thr Asn Ser Ser 

.0 450 455 460 

HCGG CGG GCC AAA AGA ACC AAG CCC AAT GGT CAC ATT GCC CAC AGG TTG 1441 
OArg Arg Ala Lys Arg Thr Lys Pro Asn Gly His He Ala His Arg Leu 

=t|465 470 475 480 

'.ft GAA ATG GAC AAC AAC ACA GGC GCT GAC AGC AGT AAC TCA GAG AGC GAA 1489 
? Glu Met Asp Asn Asn Thr Gly Ala Asp Ser Ser Asn Ser Glu Ser Glu 

485 490 495 

ACA GAG GAT GAA AGA GTA GGA GAA GAT ACG CCT TTC CTG GCC ATA CAG 1537 
Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Ala He Gin 
500 505 510 

AAC CCC CTG GCA GCC AGT CTC GAG GCG GCC CCT GCC TTC CGC CTG GTC 1585 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Ala Pro Ala Phe Arg Leu Val 
515 520 525 

GAC AGC AGG ACT AAC CCA ACA GGC GGC TTC TCT CCG CAG GAA GAA TTG 1633 
Asp Ser Arg Thr Asn Pro Thr Gly Gly Phe Ser Pro Gin Glu Glu Leu 
530 535 540 
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CAG GCC AGG CTC TCC GGT GTA ATC GCT AAC CAA GAC CCT ATC GCT GTC 1681 
Gin Ala Arg Leu Ser Gly Val lie Ala Asn Gin Asp Pro lie Ala Val 
545 550 555 560 

TAAAACCGAA ATACACCCAT AGATTCACCT GTAAAACTTT ATTTTATATA ATAAAGTATT 1741 
CCACCTTAAA TTAAACAAAA AAA I 764 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 151: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
1 5 10 15 

Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 
20 25 30 

ffipro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser 
Si 35 40 45 

CpPhe Tyr 
D 50 



y-(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 152: 

^ (i) SEQUENCE CHARACTERISTICS: 

J (A) LENGTH: 50 

I*-. (B) TYPE: amino acid 

\= (C) STRANDEDNESS: 

;f? (D) TOPOLOGY: linear 

tff (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
15 10 15 

Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 
20 25 30 

Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
35 40 45 

Val Gin 
50 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 153: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



Glu Cys Leu Arg Lys Tyr Lys Asp Phe Cys lie His Gly Glu Cys Lys 
1 5 10 15 

Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys Lys Cys Gin Gin Glu Tyr 
20 25 30 

Phe Gly Glu Arg Cys Gly Glu Lys Ser Asn Lys Thr His Ser 
35 40 45 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 154: 

O (i) SEQUENCE CHARACTERISTICS: 

C2 (A) LENGTH: 198 

Ni (B) TYPE: nucleic acid 

m (C) STRANDEDNESS: single 

ill (D) TOPOLOGY: linear 

□ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
r.-Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
£l 5 10 15 

IHgGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
^Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
*J 20 25 30 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT 
Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
50 55 60 

GAA TAG 

Glu 

65 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 155: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 
Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 40 45 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAA GCG GAG GAG CTC TAC TAA 
~Val Pro Met Lys Val Gin Thr Gin Glu Lys Ala Glu Glu Leu Tyr 
S 50 55 60 



=1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 156: 

* (i) SEQUENCE CHARACTERISTICS: 

I (A) LENGTH: 183 

- (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

=; (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

3AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
iSer His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

GTA ATG GCC AGC TTC TAC AAA GCG GAG GAG CTC TAC TAA 
Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr 
50 55 60 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 157: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 144 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

Q&TA ATG GCC AGC TTC TAC AAG CAT CTT GGG ATT GAA TTT ATG GAG AAA 192 
KVal Met Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Lys 
Sj 50 55 60 

;iGCG GAG GAG CTC TAC TAA 210 

FiAla Glu Glu Leu Tyr 

u65 



S(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 158: 

ir: (i) SEQUENCE CHARACTERISTICS: 

LH (A) LENGTH: 267 

% : (B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 

W (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 40 45 
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GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 192 
Val Pro Met Lys Val Gin Thr Gin Glu Lys Cys Pro Asn Glu Phe Thr 
50 55 60 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC 240 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser 
65 70 75 80 

ACT CCC TTT CTG TCT CTG CCT GAA TAG 267 
Thr Pro Phe Leu Ser Leu Pro Glu 

85 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 159: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

QJAGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
MSer His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
Ldl 5 10 15 

SgGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
^Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

;I 20 25 30 

f.TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
t^Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 

U 35 40 45 

WGTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 192 
§Val Pro Met Lys Val Gin Thr Gin Glu Lys Cys Pro Asn Glu Phe Thr 

yt 50 55 60 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AAA GCG GAG 240 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu 
65 70 75 80 

GAG CTC TAC TAA 252 
Glu Leu Tyr 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 160: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



CC ACA TCC ACA TCT ACA GCT GGG ACA AGC CAT CTT GTC AAG TGT GCA 
Thr Ser Thr Ser Thr Ala Gly Thr Ser His Leu Val Lys Cys Ala 
15 10 15 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGC GAG TGC TTC ATG GTG 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val 

20 25 30 

AAA GAC CTT TCA AAT CCC TCA AGA TAC TTG T GC 
Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
35 40 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 161: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



'A CAT AAC CTT ATA GCT GAG CTA AGG AGA AAC AAG GCC CAC AGA TCC 
jl His Asn Leu lie Ala Glu Leu Arg Arg Asn Lys Ala His Arg Ser 
3 1 5 10 15 

lAAA TGC ATG CAG ATC CAG CTT TCC GCA ACT CAT CTT AGA GCT TCT TCC 
?Lys Cys Met Gin lie Gin Leu Ser Ala Thr His Leu Arg Ala Ser Ser 
; 20 25 30 

ATT CCC CAT TGG GCT TCA TTC TCT AAG ACC CCT TGG CCT TTA GGA AG 
lie Pro His Trp Ala Ser Phe Ser Lys Thr Pro Trp Pro Leu Gly Arg 
35 40 45 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 162: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positions 15 and 22 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



Ala Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Xaa Phe 
15 10 15 

Met Val Lys Asp Leu Xaa Asn Pro 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 163: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



Hatg AGA TGG cga CGC gcc ccg cgc CGC TCC GGG CGT CCC GGC CCC CGG 48 
iCMet Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly Pro Arg 
= 1 5 10 15 

blGCC CAG CGC CCC GGC TCC GCC GCC CGC TCG TCG CCG CCG CTG CCG CTG 96 
PBAla Gin Arg Pro Gly Ser Ala Ala Arg Ser Ser Pro Pro Leu Pro Leu 

m 20 25 30 

mCTG CCA CTA CTG CTG CTG CTG GGG ACC GCG GCC CTG GCG CCG GGG GCG 144 
"Leu Pro Leu Leu Leu Leu Leu Gly Thr Ala Ala Leu Ala Pro Gly Ala 
35 40 45 

GCG GCC GGC AAC GAG GCG GCT CCC GCG GGG GCC TCG GTG TGC TAC TCG 192 
Ala Ala Gly Asn Glu Ala Ala Pro Ala Gly Ala Ser Val Cys Tyr Ser 
50 55 60 

TCC CCG CCC AGC GTG GGA TCG GTG CAG GAG CTA GCT CAG CGC GCC GCG 240 
Ser Pro Pro Ser Val Gly Ser Val Gin Glu Leu Ala Gin Arg Ala Ala 
65 70 75 80 

GTG GTG ATC GAG GGA AAG GTG CAC CCG CAG CGG CGG CAG CAG GGG GCA 288 
Val Val lie Glu Gly Lys Val His Pro Gin Arg Arg Gin Gin Gly Ala 

85 90 95 
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CTC GAC AGG AAG GCG GCG GCG GCG GCG GGC GAG GCA GGG GCG TGG GGC 336 
Leu Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 
100 ? 105 110 

i 

GGC GAT CGC GAG CCG CCA GCC GCG GGC CCA CGG GCG CTG GGG CCG CCC 384 

Gly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Leu Gly Pro Pro 

115 120 125 

GCC GAG GAG CCG CTG CTC GCC GCC AAC GGG ACC GTG CCC TCT TGG CCC 432 
Ala Glu Glu Pro Leu Leu Ala Ala Asn Gly Thr Val Pro Ser Trp Pro 
130 135 140 

ACC GCC CCG GTG CCC AGC GCC GGC GAG CCC GGG GAG GAG GCG CCC TAT 480 
Thr Ala Pro Val Pro Ser Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
145 150 155 160 

CTG GTG AAG GTG CAC CAG GTG TGG GCG GTG AAA GCC GGG GGC TTG AAG 528 
Leu Val Lys Val His Gin Val Trp Ala Val Lys Ala Gly Gly Leu Lys 

165 170 175 

AAG GAC TCG CTG CTC ACC GTG CGC CTG GGG ACC TGG GGC CAC CCC GCC 576 
Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Thr Trp Gly His Pro Ala 
180 185 190 



STTC CCC TCC TGC GGG AGG CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC 624 
^Phe Pro Ser Cys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe 
l] 195 200 205 

2ATG GAG CCC GAC GCC AAC AGC ACC AGC CGC GCG CCG GCC GCC TTC CGA 672 
^Met Glu Pro Asp Ala Asn Ser Thr Ser Arg Ala Pro Ala Ala Phe Arg 
J 210 215 220 

!i GCC TCT TTC CCC CCT CTG GAG ACG GGC CGG AAC CTC AAG AAG GAG GTC 720 
^Ala Ser Phe Pro Pro Leu Glu Thr Gly Arg Asn Leu Lys Lys Glu Val 
J 225 230 235 240 

ill AGC CGG GTG CTG TGC AAG CGG TGC G 745 
tflSer Arg Val Leu Cys Lys Arg Cys 

m 245 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 164: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE : 

(D) OTHER INFORMATION: Xaa in position 1 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Xaa Ala Leu Ala Ala Ala Gly Tyr Asp Val Glu Lys 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 165: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 (ix) FEATURE: 

^ (D) OTHER INFORMATION: Xaa in position 1 is unknown, 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



^Xaa Leu Val Leu Arg 

* 1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 166: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positions 1/ 2, and 3 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Xaa Xaa Xaa Tyr Pro Gly Gin He Thr Ser Asn 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 167: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in positions 25 and 36 is unknown. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
ATAGGGAAGG GCGGGGGAAG GGTCNCCCTC NGCAGGGCCG GGCTTGCCTC TGGAGCCTCT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 168: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in position 16 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



""TTTACACATA TATTCNCC 18 



l H(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 169: 
'J (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: iinear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 



Glu Thr Gin Pro Asp Pro Gly Gin lie Leu Lys Lys Val Pro Met Val 
1 5 10 15 

lie Gly Ala Tyr Thr 
20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 170: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



Met Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly 
15 10 

Ala Gin Arg Pro Gly Ser Ala Ala Arg Ser Ser Pro Pro Leu 
20 25 30 

Leu Pro Leu Leu Leu Leu Le"u Gly Thr Ala Ala Leu Ala Pro 
35 40 45 

Ala Ala Gly Asn Glu Ala Ala Pro Ala Gly Ala Ser Val Cys 
50 55 60 

Ser Pro Pro Ser Val Gly Ser Val Gin Glu Leu Ala Gin Arg 

=3 65 70 75 

flval Val lie Glu Gly Lys Val His Pro Gin Arg Arg Gin Gin 
1 85 90 



!?Leu Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala 
^ 100 105 110 

BGly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Leu Gly 
115 120 125 

3Ala Glu Glu Pro Leu Leu Ala Ala Asn Gly Thr Val Pro Ser 

U 130 135 140 

PThr Ala Pro Val Pro Ser Ala Gly Glu Pro Gly Glu Glu Ala 
pl45 150 155 

Leu Val Lys Val His Gin Val Trp Ala Val Lys Ala Gly Gly 

165 170 

Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Thr Trp Gly His 
180 185 190 

Phe Pro Ser Cys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie 
195 200 205 

Met Glu Pro Asp Ala Asn Ser Thr Ser Arg Ala Pro Ala Ala 
210 215 220 



Pro Arg 
15 

Pro Leu 
Gly Ala 

Tyr Ser 

Ala Ala 
80 

Gly Ala 
95 

Trp Gly 

Pro Pro 

Trp Pro 

Pro Tyr 
160 

Leu Lys 
175 

Pro Ala 
Phe Phe 
Phe Arg 
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Ala Ser Phe Pro Pro Leu Glu Thr Gly Arg Asn Leu Lys Lys Glu Val 
225 230 235 240 

Ser Arg Val Leu Cys Lys Arg Cys Ala Leu Pro Pro Gin Leu Lys Glu 

245 250 255 

Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys 
260 265 270 

Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
275 280 285 

Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin 
290 295 300 

Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
305 310 315 320 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp 

325 330 335 

Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Ala Thr Ser Thr 
340 345 350 



Si Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys 
^ 355 360 365 

J Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
m 370 375 380 

H'Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp 
£ 385 390 395 400 

UArg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
Q 405 410 415 

hi Phe Leu Ser Leu Pro Glu 

IJj 420 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 171: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: amino. acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys 
1 5 10 15 
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Glu Arg Gly Ser Gly Lys Lys Pro 
20 

Pro Arg Glu lie lie Thr Gly Met 
35 40 

Val Ser Ser Glu Ser Pro lie Arg 
50 55 

Asn Thr Ser Ser Ser 
65 



Glu Ser Ala Ala Gly Ser Gin Ser 
25 30 

Pro Ala Ser Thr Glu Gly Ala Tyr 

45 

lie Ser Val Ser Thr Glu Gly Ala 
60 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 172: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



**Arg Lys Gly Asp Val Pro Gly Pro Arg Val Lys Ser Ser Arg Ser Thr 
^ 1 5 10 15 

JiThr Thr Ala 



:s i.(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 173: 

-i (i) SEQUENCE CHARACTERISTICS: 

i] (A) LENGTH: 231 

=n (B) TYPE: nucleic acid 

« (C) STRANDEDNESS: single 

2 (D) TOPOLOGY: linear 

=M (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGCGAGCGCC TCAGCGCGGC CGCTCGCTCT CCCCCTCGAG GGACAAACTT TTCCCAAACC 60 

CGATCCGAGC CCTTGGACCA AACTCGCCTG CGCCGAGAGC CGTCCGCGTA GAGCGCTCCG 120 

TCTCCGGCGA GATGTCCGAG CGCAAAGAAG GCAGAGGCAA AGGGAAGGGC AAGAAGAAGG 180 

AGCGAGGCTC CGGCAAGAAG CCGGAGTCCG CGGCGGGCAG CCAGAGCCCA G 231 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 174: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



CCTTGCCTCC CCGATTGAAA GAGATGAAAA GCCAGGAATC GGCTGCAGGT TCCAAACTAG 
TCCTTCGGTG TGAAACCAGT TCTGAATACT CCTCTCTCAG ATTCAAGTGG TTCAAGAATG 
GGAATGAATT GAATCGAAAA AACAAACCAC AAAATATCAA GATACAAAAA AAGCCAGG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 175: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

!f GAAGTCAGAA CTTCGCATTA ACAAAGCATC ACTGGCTGAT TCTGGAGAGT ATATGTGCAA 60 
rAGTGATCAGC AAATTAGGAA ATGACAGTGC CTCTGCCAAT ATCACCATCG TGGAATCAAA 120 

2cg 122 



n (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 176: 
i»5 (i) SEQUENCE CHARACTERISTICS: 

hi (A) LENGTH: 102 

(B) TYPE: nucleic acid 
3£ (C) STRANDEDNESS: single 

y; (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



AGATCATCAC TGGTATGCCA GCCTCAACTG AAGGAGCATA TGTGTCTTCA GAGTCTCCCA 
TTAGAATATC AGTATCCACA GAAGGAGCAA ATACTTCTTC AT 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 177: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



CTACATCTAC ATCCACCACT GGGACAAGCC ATCTTGTAAA ATGTGCGGAG AAGGAGAAAA 
CTTTCTGTGT GAATGGAGGG GAGTGCTTCA TGGTGAAAGA CCTTTCAAAC CCCTCGAGAT 
ACTTGTGC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 178: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



IAAGTGCCAAC CTGGATTCAC TGGAGCAAGA TGTACTGAGA ATGTGCCCAT GAAAGTCCAA 
AACCAAGAA 



=,(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 179: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

'TCGGGCTCCA TGAAGAAGAT GTA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 180 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

TCCATGAAGA AGATGTACCT GCT 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

ATGTACCTGC TGTCCTCCTT GA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

TTGAAGAAGG ACTCGCTGCT CA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

AAAGCCGGGG GCTTGAAGAA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

ATGARGTGTG GGCGGCGAAA 
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What is claimed is: 



1 1. A DNA sequence encoding a polypeptide of the 

2 formula 

3 WYBAZCX 

4 wherein WYBAZCX is composed of the polypeptide 



5 segments shown in Figure 31 (SEQ ID Nos. 136-139, 141-147, 

6 160, 161, and 163) * wherein W comprises polypeptide segment 

7 F, or is absent; wherein Y comprises polypeptide segment E, 

8 or is absent? wherein Z comprises polypeptide segment G or 

9 is absent; and wherein X comprises polypeptide segments C/D 

10 HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, C/D' H, C/D' 

11 D r C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D' D, C/D D' 

12 H, C/D D' HL, C/D D' HKL, C/D' D' H, C/D' D' HKL, C/D C/D' 

13 D' H, C/D C/D' D' HL, C/D C/D' D' HKL, or C/D' D' HL; 

14 provided that, either 



15 a) at least one of F, Y, B, A, Z, C, or X is of 

16 bovine origin; or 

17 b) Y comprises polypeptide segment E; or 

18 c) X comprises polypeptide segments C/D HKL, C/D D, 

19 C/D' HKL, C/D C/D' HKL, C/D C/D' D, C/D D' H, C/D D' HL, C/D 

20 D' HKL, C/D' D' H, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' 

21 HL, C/D C/D' D' HKL, C/D'H, C/D C/D'H, or C/D C/D'HL. 

1 2. The DNA sequence of claim 1, wherein X 

2 comprises polypeptide segments C/D HKL having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141-142, 

4 146, 147, 160, 161). 
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1 3. The DNA sequence of claim 1, wherein X 

2 comprises polypeptide segments C/D' E having the amino acid 

3 sequences shown in Figure 31 (SSQ ID Nos. 136-139, 141, 143, 

4 146, 160). 

1 4. The DNA sequence of claim 1, vberein X 

2 comprises polypeptide segments C/D D having the amino acid 

3 sequences shown in Figure 31 (SEQ ZD Nos. 136-139, 141, 142, 

4 144, 160). 

1 5. The DNA sequence of claim 1, wherein X 

2 comprises polypeptide segments C/D' HKL having the amino 

3 acid sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141, 

4 143, 146, 147, 160, 161). 

1 6. The DNA sequence of claim 1, wherein X 

2 comprises polypeptide segments C/D C/D' KKL having the amino 

3 acid sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141- 

4 143, 146, 147, 160, 161). 

1 7. The DNA sequence of claim 1, wherein X 

2 comprises polypeptide segments C/D C/D' H having the amino 

3 acid sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141- 

4 143, 146, 160). 

1 8. The DNA sequence of claim 1, wherein X 

2 comprises polypeptide segments C/D C/D' EL having the amino 

3 acid sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141- 

4 143, 146, 147, 160). 
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1 9. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D C/D' D having the amino 

3 acid sequences shown in Figure 31 (SEQ ZD Kos. 136-139, 141- 

4 144, 160}. 

1 10. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D D'H having the amino acid 

3 sequences shown in Figure 31 (SEQ ZD Kos. 136-139, 141-142, 

4 145, 146, 160). 

1 11. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D D'H L having the amino 

3 acid sequences shown in Figure 31 (SEQ ZD Kos. 136-139, 141- 

4 142, 145, 146, 147, 160). 

1 12. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D D'H X L having the amino 

3 acid sequences shown in Figure 31 (SEQ ZD Nos. 136-139, 141- 

4 142, 145-147, 160, 161). 

1 13. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D' D' H having the amino 

3 acid sequences shown in Figure 31 (SEQ ZD Nos. 136-139, 141, 

4 143, 145, 146, 160). 

1 14. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D' D' H X L having the 

3 amino acid sequences shown in Figure 31 (SEQ ZD Nos. 136- 

4 139, 141, 143, 145-147, 160, 161). 
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1 15. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D C/D' D' K having the 

3 amino acid sequences shown in Figure 31 (SEQ ZD Nos. 136- 

4 139, 141-143, 145, 146, 160). 

1 16. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D C/D' D' H L having the 

3 amino acid sequences shown in Figure 31 (SEQ ZD Nos. 136- 

4 139, 141-143, 145-147, 160). 

1 17. The DKA sequence of claim 1, vherein X 

2 comprises polypeptide segments C/D C/D' D' H X L having the 

3 amino acid sequences shown in Figure 31 (SEQ ZD Nos. 136- 

4 139, 141-143, 145-147, 160, 161). 

1 18. The DKA sequence comprising coding segments 

2 S 'FBA' 3 ' coding for polypeptide segments having the amino 

3 acid sequences shown in Figure 31 (SEQ ZD Nos. 136, 138, 

4 139). 



1 19. The DKA sequence comprising coding segments 

2 $ 'F£A' 3 ' coding for polypeptide segments having the amino 

3 acid sequences shown in Figure 31 (SEQ ZD Nos. 136, 138, 

4 140). 

1 20. The DKA sequence comprising coding segments 

2 S 'F£BA 3 ' coding for polypeptide segments having the amino 

3 acid sequences shown in Figure 31 (SEQ ID Nos. 136-139, 

4 163). 
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1 21. The DNA sequence comprising coding segments 

2 5 'F£BA' 3 ' coding for polypeptide segments having the amino 

3 acid sequences shown in Figure 31 (SEC! ID Nos. 136-138, 140, 

4 163). 

1 22. Purified DNA encoding GGF2KBS5. 

1 23. A polypeptide of the formula 

2 WYBAZCX 

3 wherein KYBAZCX is composed of the polypeptide 



4 segments shown in Figure 31 (SEQ ID Nos. 136-139, 141-147, 

5 160, 161, 163} ; wherein W comprises polypeptide segment F, 

6 or is absent; wherein Y comprises polypeptide segment E, or 

7 is absent; wherein Z comprises polypeptide segment G or is 

8 absent; and wherein X comprises peptide segments C/D HKL, 

9 C/D H, C/D HL, C/D D, C/D' EL, C/D' HKL, C/D' H, C/D' D, C/D 

10 C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D' D, C/D D' H, C/D 

11 D' HL, C/D D' HKL, C/D' D' H, C/D' D' HKL, C/D C/D' D' H, 

12 C/D C/D' D' HL, C/D C/D' D' HKL, or C/D' D' HL; provided 

13 that, either 

14 a) at least one of F, Y, b, A, Z, C, or X is of 

15 bovine origin; or 

16 b) Y comprises polypeptide segment E; or 

17 C) X comprises polypeptide segments C/D HKL, C/D' 

18 HKL, C/D D, C/D C/D' HKL, C/D C/D' D, C/D D' H, C/D D' HL, 

19 C/D D' HKL, C/D' D' H, C/D' D' HKL, C/D C/D' D' H, C/D C/D' 

20 D' HL, C/D C/D' D' HKL, C/D'H, C/D C/D'H, or C/D C/D'HL. 



1 24. A polypeptide of elaim 23, vherein X comprises 

2 C/D KKL polypeptide segments having the amino acid sequences 

3 shown in Figure 31 (SEQ ID Nos. 136-139, 141-142, 146, 147, 

4 160, 161). 

1 25. A polypeptide of claim 23, wherein X comprises 

2 C/D D polypeptide segments having the amino acid sequences 

3 shown in Figure 31 (SEQ ID Nos. 136-139, 141, 142, 144, 

4 160). 

1 26. A polypeptide of claim 23, wherein X comprises 

2 C/D' H polypeptide segments having the amino acid sequences 

3 shown in Figure 31 (SEQ ID Nos. 136-139, 141, 143, 146, 

4 160). 

1 27. A polypeptide of claim 23, wherein X comprises 

2 C/D' HKL polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141, 143, 

4 146, 147, 160, 161). 

1 28. A polypeptide of claim 23, wherein X comprises 

2 C/D C/D' HKL polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141-143, 

4 146, 147, 160, 161). 

1 29. A polypeptide of claim 23, wherein X comprises 

2 C/D C/D' H polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141-143, 

4 146, 160). 
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1 30. A polypeptide of claim 23, wherein X comprises 

2 c/D C/D' EL polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Kos. 136-139, 141- 

4 143,146, 147, 160). 

1 31. A polypeptide of claim 23, wherein X comprises 

2 C/D C/D' D, polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Kos. 136-139, 141-144, 

4 160). 

1 32. A polypeptide of claim 23, wherein X comprises 

2 C/D D'H polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Hos. 136-139, 141, 142, 

4 145, 146, 160). 

1 33. A polypeptide of claim 23, wherein X comprises 

2 C/D D'H L polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Hos. 136-139, 141, 142, 

4 145-147, 160). 

1 34. A polypeptide of claim 23, wherein X comprises 

2 C/D D'H K L polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Kos. 136-139, 141, 142, 

4 145-147, 160, 161). 

1 35. a polypeptide of claim 23, wherein X comprises 

2 C/D' D' H polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Hos. 136-139, 141, 143, 

4 145, 146, 160). 
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1 36. A polypeptide of claim 23, wherein X comprises 

2 C/D' D' H K L polypeptide segments having the amino acid * 

3 sequences shown in Figure 31 (SEQ ID Hos. 136-139, 141, 143, 

4 145-147, 160, 161). 

1 37. A polypeptide of claim 23, wherein X comprises 

2 C/D C/D' D' B polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Hos. 136-139, 141-143, 

4 145, 146, 160). 

1 38. A polypeptide of claim 23, wherein X comprises 

2 C/D C/D' D' H L polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141-143, 

4 145-147, 160). 



1 39. A polypeptide of claim 23, wherein X comprises 

2 C/D C/D' D' H X L polypeptide segments having the amino acid 

3 sequences shown in Figure 31 (SEQ ID Nos. 136-139, 141-143, 

4 145-147, 160, 161). 



1 40. A polypeptide comprising FBA polypeptide 

2 segments having the amino acid sequences shown in Figure 31 

3 (SEQ ZD NOS. 136, 138, 139). 

1 41. A polypeptide coaprising FEBA polypeptide 

2 segments having the amino acid sequences shown in Figure 31 

3 (SEQ ID Nos. 136-139, 163). 



1 42. A polypeptide coaprising FBA' polypeptide 

2 segments having the amino acid sequences shown in Figure 31 

3 (SEQ ID NOS. 136, 139, 140). 
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1 43. A polypeptide comprising FEBA' polypeptide 

2 segments having the amino acid sequences 'shown in Figure 31 

3 (SEQ ID Hos. 136-139, 140, 163). 

1 44. Purified G6F2KBS5 polypeptide. 

1 45. A basic polypeptide factor having aitogenic 

2 activity stimulating the division of Schwann cells in the 

3 presence of fetal calf plasma, said polypeptide having a 

4 molecular weight of from about 30 kD to about 36 kD, said 

5 polypeptide including within its amino acid sequence any one 
€ or more of the following polypeptide sequences: 



7 FKCDAHTE 

8 ASLADEYEYKXK 

9 TETSSSGLXLK 

10 ASLADEYEYKRK 

11 AGYFAEXAK 

12 TTEKASEQGA 

13 AKEALAALK 

14 FVLQAKK 

15 ETQFDFGQILKKVPMV2GAYT 

16 EYK CLKFKWFKKATVM 

17 EXKFYVP 

18 KLEFLXAK 



1 46. A basic polypeptide factor having mitogenic 

2 activity stimulating the division of Schwann cells in the 

3 presence of fetal calf plasma, said polypeptide having a 

4 molecular weight of from about 55 kD to about 63 JcD, and 

5 said polypeptide including within its amino acid sequence 

6 any one or more of the following peptide sequences: 
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7 VHQVWAAK 

6 YIFFHEFEAX.SSG 

$ L C A K G P P A F P V H 

10 WFVVJE6K 

11 ASPVE VCEVQEIVQR 

12 VCILTVAAIPPT 

13 KVHQVWAAK 

14 KASLADSGEYHX* 

15 DLLLXV 

16 EGXVEFQH6AIDH 

17 PS CGRUEDSRnFFKE 
IS ELNRKHKPQNIKIQKK 

1 47. A method for stimulating mitogenesis of & glial 

2 cell, eaid method comprising contacting said glial cell with 

3 a polypeptide defined by the formula 

4 WYBAZCX 

5 wherein KYBAZCX is composed of the polypeptide 



6 segments shown in Figure 31 (SEQ ID Hos. 136-139, 141-147, 

7 160, 161, 163); wherein K comprises polypeptide segment T, 

8 or is absent; wherein Y comprises polypeptide segment £, or 

9 is absent; wherein 2 comprises polypeptide segment C or is 

10 absent; and wherein X comprises polypeptide segments C/D 

11 HKL, C/D H, C/D HL, C/D D, C/D' KL, C/D' HKL, C/D' B, C/D' 

12 D, C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D r D, C/D D' 

13 H, C/D D' KL, C/D D' KKL, C/D' D' K, C/D' D' HL, C/D' D' 

14 HKL, C/D C/D' D' H, C/D C/D' D' HL, or C/D C/D' D' HKL. 
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1 48,. A method for stimulating mitogenesis of a glial 

2 cell, »aid method comprising contacting said glial cell vith 

3 a polypeptide comprising FBA polypeptide segments having the 

4 amino acid sequences shovn in Figure 31 (SEQ ZD Nos. 136, 

5 138, 139). 

1 49. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide comprising FBA' polypeptide segments having 

4 the amino acid sequences shovn in Figure 31 (SEQ ZD Nos. 

5 136, 138, 140). 

1 50. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide comprising FEBA polypeptide segments having 

4 the amino acid sequences shovn in Figure 31 (SEQ ZD Nos. 

5 .136-139, 163). 

1 51. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide comprising FEBA' polypeptide segments having 

4 the amino acid sequences corresponding to polypeptide 

5 segments shovn in Figure 31 (SEQ ZD Nos. 136-138, 140, 163) 

6 to glial cells* 



1 52* A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 GGF2HBS5 polypeptide. 

1 53* A method of stimulating mitogenesis of a glial 

2 cell said method comprising contacting said glial cell vith 

3 a compound which specifically binds the plB5* rbB2 receptor 

4 of glial cells. 
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1 54. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide, comprising EGFLl, having the amino acid 

4 sequence shown Fig. 38, Seq. ZD No. 154. 

1 55. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide, comprising EGFL2, having the amino acid 

4 sequence shown in Figure 39, Seg. ID Ho. 155. 

1 56. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell with 

3 a polypeptide, comprising EGFL 3, vith the amino acid 

4 sequence shown in Fig. 40, Seq. ID Ho. 156. 

1 57. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide, comprising EGFL4, vith the amino acid 

4 sequence shown in Fig. 41, Seq. ID Ho. 157. 

1 58. A method of stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide, comprising EGFL5, vith the amino acid 

4 sequence shown in Fig. 42, Seq. ID Ho. 158, to glial cells. 

1 59. A method of stimulating mitogenesis of a glial 

2 cell, eaid method comprising contacting said glial cell vith 

3 a polypeptide, comprising EGFL6, vith the amino acid 

4 sequence shown Fig. 43, Seq. ID Ho. 159. 

1 60. A method for the prophylaxis or treatment of a 

2 pathophysiological condition of the nervous system in a 

3 mammal in which said condition involves a cell type which is 
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4 sensitive or responsive to a polypeptide as defined in any 

5 one of claims 1 and 18-22, said method comprising 

6 administering to said mammal an affective amount of said 

7 polypeptide. 



1 61. A method as claimed in claim 60, wherein said 

2 condition Involves peripheral nerve damage. 

1 62. The method as claimed in claim 60, vherein said 

2 condition involves glia of the central nervous system. 

1 63. A method of stimulating mitogenic activity in a 

2 glial cell, said method comprising applying 35 kD 

3 polypeptide factor isolated from the rat I-EJ transformed 

4 fibroblast cell line to said glial cell. 

1 64. A method of stimulating mitogenic activity in a 

2 glial cell, said method comprising applying 75 W> 

3 polypeptide factor isolated from the SKBR-3 human breast 

4 cell line to said glial cell. 

1 65. A method of stimulating mitogenic activity in a 

2 glial cell, said method comprising applying 44 IcD 

3 polypeptide factor isolated from the rat 2-SJ transformed 

4 fibroblast cell line to said glial cell. 

1 66. A method of stimulating mitogenic activity in a 

2 glial cell, said method comprising applying 45 JcD 

3 polypeptide factor isolated from the KDA - KB 231 human 

4 breast cell line to said glial cell. 
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a 67. A method of stimulating mitogenic activity in a 

2 glial cell, said aethod comprising applying 7 to 14 IcD 

3 polypeptide factor isolated from the ATL-2 human T-eell line 

4 to said glial cell. 

1 68. A aethod of stimulating aitogenic activity in a 

2 glial cell, said aethod comprising applying 25 YD 

3 polypeptide factor isolated from activated aouse peritoneal 

4 macrophages to said glial cell. 

1 69. A method of stimulating mitogenic activity In a 

2 glial cell, said aethod comprising applying a 25 JcD 

3 polypeptide factor isolated from bovine kidney to said glial 

4 . cell. 

1 70. A aethod of stimulating aitogenic activity in a 

2 glial cell, said aethod comprising applying ARIA polypeptide 

3 to said glial cell. 

1 71. A polypeptide factor having glial cell 

2 mitogenic activity and including an amino acid sequence 

3 encoded by:- 

4 (a) a DNA sequence shown in any one of Figures 28a, 

5 28b or 28c (SEQ ID Nos. 133-135, respectively). 

6 (b) a DNA sequence shown in Figure 22 (SEQ ID Ho. 

7 89); 

8 (c) the DNA sequence represented by nucleotides 

9 281-557 of the sequence shown in Figure 28a. 

10 (d) a DNA sequence hybridizable to any one of the 

11 DNA sequences according to (a) , (b) or (c) . 
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1 72. A basic polypeptide factor having a aolecular 

2 weight, whether in reducing conditions or not, of from about 

3 30 kD to about 36 JcD en 8DS-polyaerylamide gel 

4 electrophoresis, said polypeptide factor having mitogenic 

5 activity stimulating the division of rat Schwann cells in 

6 the presence of fetal calf plasma, and when isolated using 

7 reversed-phase KPLC retaining at least 50% of said activity 
S after 10 weeks incubation in 0.1% trifluoroacetic acid at 

9 4»C. 

1 73. A basic polypeptide factor having a molecular 

2 weight, under non-reducing conditions, of froo about 55 kD 

3 to about 63 kD on SDS-polyacrylamide gel electrophoresis, 

4 said polypeptide factor having mitogenic activity 

5 stimulating the division of rat Schwann cells in the 

6 presence of fetal calf plasma,- and when isolated using 

7 reversed-phase KPLC retains at least about 50% of said 

8 activity after 4 days incubation in 0.1% trifluoroacetic 

9 acid at 4*C. 
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1 74. A method for the preparation of a polypeptide 

2 defined in claim 72 or claim 73, said method comprising 

3 extracting vertebrate brain material to obtain protein, 

4 subjecting said protein to chromatographic purification 

5 comprising hydroxylapatite EPLC end thereafter to SDS* 

6 polyacrylamide gel electrophoresis and collecting that 

7 fraction therefrom which has an observed molecular weight of 

8 about 30 kO to 36 kD and/or that fraction which has an 

9 observed molecular weight of about 55 kD to 63 XD if , in 

10 either case, subjected to SDS-polyacrylamide gel 

11 electrophoresis; in the case of said smaller molecular 

12 weight fractions whether in reducing conditions or not, and 

13 in the case of said larger molecular weight fraction under 

14 non-reducing conditions, and which fraction (s) exhibit (s) 

15 mitogenic activity stimulating the division of rat Schwann 

16 cells against a background of fetal calf plasma. 



1 75. A method as claimed in claim 74, wherein the 

2 brain material in said method is pituitary material. 

1 76. A method as claimed in claim 75, wherein said 

2 pituitary material in said method is bovine. 

1 77. A method as claimed in claim 74, wherein said 

2 protein used in said method is initially extracted from 

3 brain material is first subjected to carboxymethyl cellulose 

4 chromatography. 

1 78. A method as claimed in claim 74 wherein after 

2 said hydroxylapatite HPLC, said method uses cation exchange 

3 chromatography, gel filtration, and/or reversed-phase HPLC. 
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1 79. A aethod as el a iced In claim 74, vherein at 

2 each stage of said aethod biological activity 'of Material 

3 obtained is assessed for aitogenie activity stimulating the 

4 division of rat Schwann cells in the presence of fetal calf 

5 plasma. 

1 SO* A aethod for assaying a substances for glial 

2 cell aitogenic activity, said aethod comprising contacting 

3 said substance vith glial cells in the presence of .fetal 

4 calf plasma, and the aeasuring DNA synthesis in said glial 

5 cells as a measure of glial cell aitogenic activity. 

1 81. An assay as claimed in claim SO, vherein said 

2 glial cells ere Schwann cells. 

1 62. A DNA sequence encoding a polypeptide having 

2 glial cell aitogenic activity and comprising: 

3 (a) a DNA sequence shown in any one of Figures 28a, 

4 28b, or 28c (SEQ ZD KOS. 133-135) 

5 (b) a DNA sequence shown in Figure 22 (SEQ ID Ho. 

6 89); 

7 (c) the DNA sequence represented by nucleotides 
S 2 8 1-557 of the sequence shown in Figure 28a; or 

9 (d) a DNA sequence hybridizable to any one of the 

10 DNA sequences according to (a), (b) or (c). 

1 63. A polypeptide vhieh is a. glial cell mitogen, 

2 said polypeptide being encoded by a DNA sequence as defined 

3 in claim 82, said polypeptide obtained by a aethod 



4' comprising for the preparation of a glial cell aitogenic 

5 factor, said aethod cultivating aodified host cells under 

6 conditions permitting expression of said DNA sequence. 
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1 84. A vector comprising_a DNA sequence as defined 

2 in claim 82. 



1 85. A host cell containing the isolated DKA of 

2 claim 84. 

1 86. A method for the preparation of a glial call 

2 mitogenic factor, said method comprising cultivating 

3 modified host cells as defined in claim 85 under conditions 

4 permitting expression of said DNA sequence. 

1 87. A polypeptide which is a glial cell mitogen, 

2 said polypeptide being encoded by a DNA sequence as defined 

3 in claim 1, said polypeptide obtained by a method comprising 

4 for the preparation of a glial cell mitogenic factor, said 

5 method cultivating modified host cells under conditions 

6 permitting expression of said DNA sequence. 

1 88. A polypeptide vhich is a glial cell mitogen, 

2 said polypeptide being encoded by a DNA sequence as defined 

3 in any one of claims 18-22, said polypeptide obtained by a 

4 method comprising for the preparation of a glial cell 

5 mitogenic factor, said method cultivating modified host 

6 cells under conditions permitting expression of said DNA 

7 sequence. 

1 89. A method for detecting, in a sample, the 

2 presence of a molecule having a receptor binding 

3 characteristic of a polypeptide defined in any one of claims 

4 23, 40-46, 71-73, or 87, said method comprising the steps of 

5 ' a) contacting said sample with a polypeptide of any 

6 cne of claims 22, 39-42, 63-65, 72, 73 or 80, along with a 
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7 receptor capable of binding specifically to said 

8 polypeptide, .and 

9 b) detecting competitive inhibition of the binding 

10 of said polypeptide to said receptor as an indication of the 

11 presence of a receptor binding molecule in said sample. 

1 90* A method for the prophylaxis or treatment of a 

2 glial tumor in a patient, said method comprising 

3 administering to said patient an effective amount of a 

4 substance vhich inhibits the binding of a factor as defined 

5 in any one of claims 23, 40*46, 71*73, or 87 to a receptor 
€ therefor. 



1 91* A pharmaceutical or veterinary formulation 

2 comprising a polypeptide as defined in any of claims 23, 40- 

3 46, 71-73, or B7 formulated for pharmaceutical or veterinary 

4 use, respectively, together with 'an acceptable diluent, 

5 carrier or excipient and/or in unit dosage form. 

1 92. A method for stimulating mitogenesis of a glial 

2 cell, said method comprising contacting said glial cell vith 

3 a polypeptide as defined in any one of claims 23, 40-46, 71- 

4 73, or 87. 

1 S3. A polypeptide, as defined in any one of claims 

2 23, 40-46, 71-73, or 87 for use as a glial cell mitogen* 

1 94* A method for stimulating mitogenesis of a glial 

2 cell in a vertebrate, said method comprising contacting said 

3 glial cell vith an effective amount of a polypeptide defined 

4 in any one of claims 23, 40-46, 71-73, or 87 to glial cells. 
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1 95. A method for the prophylaxis or treatment of 

1 pathophysiological condition of the nervous system in a 

3 »ammal in which said condition involves a cell type which is 

4 sensitive or responsive to a polypeptide as defined in any 

5 one of claims 23, 40-46, 71-73, or 87, said method 

6 comprising administering an effective amount of said 

7 polypeptide. - 

1 96. A method for the treatment of a condition which 

2 involves peripheral nerve damage in a mammal, said method 

3 comprising contacting said peripheral nerves with an 

4 effective amount of a polypeptide, as defined in any one of 

5 claims 23, 40-46, 71-73, or 87. 

1 97. A siethod for the prophylaxis or treatment of a 

2 condition in a mammal in said condition involves 

3 demyelination or daaage or loss of Schwann cells, for 

4 example a neuropathy of sensory or motor nerve fibers, said 

5 method comprising contacting said Schwann an effective 

€ amount of a polypeptide, as defined in any one of claims 23, 

7 40-46, 71-73, or 87. 

1 98. A method for the prophylaxis or treatment of a 

2 neurodegenerative disorder in a mammal, said method 

3 comprising contacting glial cells in a mammal with an . 

4 effective amount of a polypeptide as defined in any one of 

5 claims 23, 40-46, 71-73, or 87. 

2 99. A method for inducing neural regeneration 

2 and/or repair in a mammal, said method comprising contacting 

3 glial cells in a mammal with an effective amount of a 

A polypeptide as defined in any one of claims 23, 40-46, 71- 

5 73, or 87. 
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1 100. A method of inducing fibroblast proliferation, 

2 said nethod comprising contacting said fibroblasts with a 

3 polypeptide, as defined in any one of claims 23, 40-46, 71- 

4 73, or 87. * 

1 101. A method of wound repair in mammals, said 

2 method comprising contacting said wound with a polypeptide, 

3 as defined in any one of claims 23, 40-46, 71*73, or 87- 



102. A method of making a medicament comprising 
admixing a polypeptide as defined in any one of claims 23, 
40-46, 71-73, or 87 with a pharmaeeutically acceptable 
carrier. 

103. A method for producing an antibody, said method 
comprising immunizing a mammal with a polypeptide of any one 
of claims 23, 40-46, 71-73, or 87. 



1 104. A method for detecting, in a sample, the 

2 presence of a molecule having a receptor binding 

3 characteristic of a polypeptide defined in any one of claims 

4 23, 40-46, 71-73, or 87, said method comprising the steps of 

5 a) contacting said sample with a polypeptide of any 

6 one of claims 23, 40-46, 71-73, or 87, along with a receptor 

7 capable of binding specifically to said polypeptide, and 

8 b) detecting competitive inhibition of the binding 

9 of said polypeptide to said receptor as an indication of the 
10 presence of a receptor binding molecule in said sample. 

1 105. A method for detecting a receptor which capable 

2 of binding to a polypeptide as defined in any one of claims 

3 23, 40-46, 71-73, or 87, said method comprising carrying out 



1 
2 
3 
4 

1 
2 
3 
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4 affinity .isolation en said sample using a said peptide as 

5 the affinity ligand. 

1 106. A method for the prophylaxis or treat&ent of a 

2 glial tumor in a patient, said method coaprising 

3 administering to said patient an affective amount of a 

4 substance which inhibits the binding of a factor as defined 

5 in any one of claims 23, 40-46, 71*73, or 87 to a receptor 
€ therefor. 



1 107. A peptide selected from the following:- 





2 


FKGDAHTE 




3 


ASLADEYEYHXK 




4 


TEIESSGLHK 




5 


ASLADEYEYHRK 




6 


AGYFAEXAR 




7 


TTEHASEQ6A 




e 


AKEALAALK 




9 


F V I C A K K 




10 


ETQPDPGQ1LKKVPKVIGAYT 




11 


EYXCLXFXWFXXATVM 




12 


EXKFHP 




13 


XLEFLXAK 




14 


VBQVK.AAK 




15 


YIFFMEPEAXS6G 




16 


LCAWGFFAFPVXY 




17 


KFVVIEGK 




16 


ASFVSVGSVQELVQR 




19 


VCLLTVAALFPT 




20 


KVHCVWAAK 




21 


XASLADSGEYMXX 




22 


DLLLXV 
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1 108. A DNA sequence as shown in any one of Figures 

2 28a, 28b and 28c (SEQ ZD No. 133-135, respectively). 

1 109. A polypeptide encoded by a DNA sequence as 

2 defined in dais 108 (SEQ ZD Kos. 133-135). 

1 110. An antibody to a polypeptide as defined in 

2 clair 107. 

1 ill. A method of investigating, isolating or 

2 preparing a glial cell mitogen or gene sequence encoding 

3 said glial cell mitogen, said method comprising contacting 

4 tissue preparations or samples vith an antibody, said 

5 antibody prepared as defined in claim 103. 

1 112. A method for isolating a nucleic acid sequence 



2 coding for a molecule having glial cell mitogenic activity, 

3 said method comprising contacting a cell containing sample 

4 vith a glial cell mitogen specific antibody to determine 

5 expression of said mitogen in said sample and isolating said 

6 nucleic acid sequence from the cells exhibiting said 

7 expression. 

1 113. The purified GGF2 polypeptide comprising the 

2 a&ino acid sequence shown in Fig. 45 herein (SEQ ZD Ho. 

3 167). 

1 114. A purified CSF2 DNA encoding the 6SF2 

2 polypeptide whose sequences is shown, in Fig. 45 (SEQ ZD Ko. 

3 167). 

1 115. A method for inducing myelination of a neural 

2 cell by a Schwann cell, said method comprising contacting 
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3 said Schwann cell with a polypeptide of any one of claims 

4 23, 40-46, 71-73, or 87. 



1 116. A method for inducing acetylcholine receptor 

2 synthesis in a cell, said method comprising contacting of 

3 said cell vith a polypeptide of any one of claims 23, 40-46, 

4 71-73, or 87. 

1 117. An antibody to a polypeptide as defined in 

2 claim 23. 

1 118. An antibody to a polypeptide as defined in 

2 claim 40. 

1 119. An antibody to a polypeptide as defined in 

2 claim 41. 

1 120. An antibody to a polypeptide as defined in 

2 claim 42. 

1 121. An antibody to a polypeptide as defined in 

2 claim 43. 

1 122. An antibody to a polypeptide as defined in 

2 claim 44. 

1 123. An antibody to a polypeptide as defined in 

2 claim 45. 

1 124. An antibody to a polypeptide as defined in 

2 claim 46. 
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1 125. An antibody to a polypeptide as defined in 

2 elaim 71. 

1 126. An antibody to a polypeptide as defined in • 

2 claia 72. 

1 127. An antibody to a polypeptide as defined in 

2 claim 73. 

1 128. An antibody to a polypeptide as defined in 

2 dais 67. 

1 129. A method of purifying a protein vith glial cell 



2 mitogenic activity, said method comprising contacting a cell 

3 extract vith an antibody of any one of elaims 117-128. 

1 130. A method of treating a mammal suffering from a 

2 disease of glial cell proliferation, said method conprising 

3 administering to said mammal an antibody of any one of 

4 claims 117-128. 

1 131. A vector conprising a DNA sequence as defined 

2 in any one of claims 1 or 18-22. 
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Abstract cf the Disclosure 
Disclosed is the characterization and purification 
of DNA encoding a numerous polypeptides useful for the 
stimulation of glial cell (particularly, Schwann cell) 
mitogenesis and treating glial cell tumors. Also disclosed 
are DNA sequences encoding novel polypeptides which may have 
use in stimulating glial cell mitogenesis and treating glial 
cell tumors. Methods for the synthesis, purification and 
testing of both known and novel polypeptides for their use 
as both therapeutic and diagnostic aids in the treatment of 
diseases involving glial cells are also provided* Methods 
are also provided for the use of these polypeptides for the 
preparation of antibody probes useful for both diagnostic 
and therapeutic use in diseases involving glial cells. 
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(3) Luisa Minchetti frh'^ Uft £i j 4 / 1 i 

Full Name /Third Inventor Sign^tufe Date 

Residence: Via Stradello. 22 . 

48012 Baanacavallo fRA) Italy Italian 

Citizenship 

Post Office Address: Via Stradello. 22 . 

48012 Baanacavallo fRA) Italy 



(4) Michael Waterfield 1 : _ 

Full Name /Fourth Inventor Signature Date 

Residence: Chantemerlc. Speen La ne. Speen, 

Newburv. Berkshire RG13 1RN British ^ 

United Kingdom Citizenship 

Post Office Address: Chantemzrlc. Speen Lane. Spe en. Newbury 

Berkshire RG13 1RN. United Kingdom 



(5) Mark Marchioni 

Full Name/Fifth Inventor Signature Dat< 

Residence: 24 Twin Circle Drive _ 

Arlington. Massachusetts 02174 U.S.A. 

Citizenship 

Post Office Address: 24 Twin Circle Drive 

Arlington. Massachusetts 02174 _ 



(6) Mario Su Chen — 

Full Name/Sixth Inventor Signature Dati 

Residence: 65 Decatur St. 

Arlington. Massachusetts 02174 U.S.A. 

Citizenship 



Post Office Address: 65 Deca tur St. 

Arlington. Massachusetts 02174 
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(7) Ian Kiles ^^J^T '1- j - Y'fc 

Full Name /Seventh Inventor Signature Date 

Residence: 91 Riding House St. 

London W1P 8BT* England British 

Citizenship 

Post Office Address: Courtauld Building. 91 Riding House St. 

London W1P 8BT* England 
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Paver of Attorney ^ ^ . . . . . . 

I hereby appoint the following attorneys to prosecute this application and 
transact all business in the Patent and Trademark Office connected therewith: 
John E. Lynch, Reg. No. 20,940; Peter F. Felfe, Reg. No. 20,297; Alfred H. 
Hemingway, Jr., Reg. No. 26,736; Vincent M. Fazzari, Reg. No. 26,879; Hallie 
R Levie Reg. No. 31,116; Charles A. Blank, Reg. No. 17,419; Norman D. 
Hanson, Reg. No. 30,946; Walter G. Weissenberger, Reg. No. 17,344; F. Br ice 
Faller, Reg. No. 29,532; Andrew L. Tiajoloff, Reg. No. 31,575; John P. 
Luther, Reg. No. 32,261; Christine H. Tsai, Reg. No. 34,266 and John A. 
Bauer, Reg. No. 32,554, my attorneys with full power of substitution and 
revocation. Address all telephone calls to Christine H. Tsai, at (212) 688- 
9200. Address all correspondence to: 

FELFE & LYNCH 

805 Third Avenue 

New York, New York 10022 

I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be 
true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of 
the application or any patent issued thereon. 



fl) Andrew Good earl __ . 

Full Name /Sole or First Inventor Signature Daze 

Residence: 45 Blacketts Wood Drive. _ ..... 

Chorlevwood. Hertfor dshire. WD3 SPY British 

nrW*«rf Kingdom Citizenship 

Post Office Address: 45 Blacketts Wo od Drive. Chorleywood, 

Hertfordshire. WD3 5PY . United Kingdom 



(2\ Paul stroobant ; — . . 

Full Name /Second Inventor Signature Daze 

Residence: 52 A Ceci l e Park. Crouch End 

London N8 9 AS. United Kingd om New Zealand 

~ Citizenship 

Post Office Address: 52 A Cec il e Park. Crouch End _ _ 

London N8 9 AS. United Kingdom 
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(3) Luisa Minghetti 

Full Name/Third Inventor Signature Date 

Residence: Via Stradello* 22 

48012 Bagnacavallo (RA) Italy Italian 

Citizenship 

Post Office Address: Via Stradello, 22 

48012 Bagnacavallo (RA) Italy 



(4) Michael Waterfield 

Full Name /Fourth Inventor Signature Date 

Residence: Chantemerlc, Speen Lane. Speen. 

Newbury, Berkshire RG13 1RN British 

United Kingdom ' Citizenship 

Post Office Address: Chantemerlc. Speen Lane. Speen f Newbury 

Berkshire RG13 lRN f United Kingdom 



(5) Mark Marchioni 



Full Name/Fifth Inventor Signature 'Date 

Residence: 24 Twin Circle Drive 

Arlington. Massachusetts 02174 U.S.A. 

Citizenship 

Post Office Address: 24 Twin Circle Drive 



Arlington, Massachusetts 02174 



(€) Mario Su Chen 

Full Name/Sixth Inventor " ^Signature Date 
Residence: 65 Decatur St. 



Arlington. Massachusetts 02174 U.S.A. 

Citizenship 

Post Office Address: 65 Decatur St. 



Arlington. Massachusetts 02174 
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f7) Tan Hiles 

Full Name/Seventh Inventor ' Signature ' Date 

Residence: 91 Riding House St. 

London W1P bbt. England British 

Citizenship 

Post Office Address: Courtauld Building. 91 Riding House St. 

London W1P 8BT. England 
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Appl ic ant /Patentee : Andrew Goodearl et al. 

Serial/Patent No.: 08/036,555 

Filed/Issued: March 24 , 1993 

For* GLIAL MITOGENIC FACTORS , THEIR PREPARATION AND USE 



Attorney's Docket No.: LUD 250+4-JEL/CHT 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27(e)) - SMALL BUSINESS CONCERN 

I hereby declare that I am 

( ) the part-owner of the small business concern identified below: 
(X) an official of the small business concern empowered to act on behalf of the 
concern identified below: 

NAME OF CONCERN Cambridge NeuroScience 

ADDRESS OF CONCERN One Kendall Square 

Cambridge, Massachusetts 02139 

I hereby declare that the above identified small business concern qualifies as a small 
business concern as defined in 13 CFR 121*3-18, and reproduced in 37 CFR 1.9(d), for purposes 
of paying reduced fees under section 41(a) and (b) of Title 35, United States Code, in that 
the number of employees of the concern, including those of its affiliates, does not exceeded 
500 persons. For purposes of this statement, (1) the number of employees of the business 
concern is the average over the previous fiscal year of the concern of the persons employed 
on a full-time, part-time or temporary basis during each of the pay periods of the fiscal 
year, and (2) concerns are affiliates of each other when either, directly or indirectly, one 
concern controls or has the power to control the other, or a third party or parties controls 
or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with the 

small business concern identified above with regard to the invention, entitled 

GLIAL MITOGENIC FACTORS, THEIR PREPARATION AND USE 



by inventor (a) Andrew Goodearl et al. 

described in 

( ) the specification filed herewith 

(X) application Serial No, 036,5*5 filed March 24, 1993. 
{ ) Patent No* , issued 

If the rights held by the above identified small business concern are not exclusive, eaci 
individual, concern or organization having rights to the invention is listed below* and nc 
rights to the invention are held by any person, other than the inventor, who could not 
qualify as a small business concern under 37 CFR 1.9(d) or by any concern which would not 
qualify as a small business concern under 37 CFR 1*9 (d) or a nonprofit organization unde: 
37 CFR 1.9(e)* *NOTE: Separate verified statements are required from each named person 
concern or organization having rights to the invention averring to their status as smai: 
entities. (37 CFR 1.27). 

Cont'd 

FELFE £ LYNCH 
S05 Third Avenue 
New York, New York 10022 
(212) 688-9200 



Verified Statement /Declarati on! Clai»j pa saall Bptitv Btatu» 
.(37 CFR 1.9(f) and 1.27 fell - Saall Subnets Con»n, 

Page 2 

Applicant /Patentee: Andrew Goodearl et al 

Serial/Patent Wo.; U8/036.555 
Filed/Issued: Ma f e h 24. 1093 

Pors GLIAL HITOGENIC FACT ORS, PREPARATION AWn TlfiP 

Attorney's Docket Wo.: Ltro 250.4-jel/cht 



NAM E 
ADDRESS 



( ) INDIVIDUAL ( ) SMALL BUSINESS CONCERN ( ) NONPROFIT ORGANIZATION 



NAM E 
ADDRESS 



( ) INDIVIDUAL ( ) SMALL BUSINESS CONCERN ( ) NONPROFIT ORGANIZATION 



statS re^itino / f . Ue in thiS a ^ lication ~ P"ent, notification of any change in 
status resulting m loss of entitlement to small entity status prior to paying, or at the 

S^h ° f A aying ' the earlie8t of the i88 « *°* or any maintenance fee due l/ter t*l date on 
whxch status as a small entity is no longer appropriate. (37 CFR 1.28(b)). 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true? and furthest these 
^rSZrf with the knowledge that willful false statements and the like so ma" 
States Coc£ and JET* °' ^ priB °™* nt ' or both ' Section ^01 of Title 18 of the United 

h Willful falBe statements may jeopardize the validity of the 

Erected" 11 ' ^ Pat6nt iB8Uin9 there ° n ' " " y P * tent tC WhiC * thiB verified st*t ero ° e f nt is 



KAME OF PERSON SIGNING Gregory B. m.tl.r 

JiSrcc^If N ° THER **** OWNE R T>irl^r . T.L. j AffalrB p , tent Cftnnao1 

nuuRESS OF PERSON SIGNING One Kendal l Square 



Cambri dge. Massachusetts 02139 



SIGNATURE^ 
DATE 



FELFE & LYNCH 
805 Third Avenue 
New York, New York 10022 
(212) 688*9200 




Applicant /Patentee; And rev G o o d earl et al. : 

Serxal/Patent No.: 1 £ _ — 

Filed/Issued: March 24, 1993 

VfttM GLIAL MITOGENIC FATORS . THEIR PREPARATION AND USE 

Attorney's Docket wo.! LtTD 2S0.4-JEL/CHT . 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27(d)) - NONPROFIT ORGANIZATION 



I hereby declare that I am an official empowered to act on behalf of the nonprofit 
organization identified below: 

NAME OF ORGANIZATION Ludwio Institute for Can cer Research 

ADDRESS OF ORGANIZATIO N 1345 Avenue of the Americas 

New York. New York 10105 



TYPE OF ORGANIZATION 



( ) UNIVERSITY OR OTHER INSTITUTE OF HIGHER EDUCATION 

(X) TAX EXEMPT UNDER INTERNAL REVENUE SERVICE CODE (26 USC 501(a) and 501(c)(3)) 
( ) NONPROFIT SCIENTIFIC OR EDUCATIONAL UNDER STATUTE OF STATE OF THE UNITED STATES 
OF AMERICA 

(NAME OF STATE > 

(CITATION OF STATUTE — > 

( ) WOULD QUALIFY AS TAX EXEMPT UNDER INTERNAL REVENUE SERVICE CODE (26 USC 501(a) 
and 501(C)(3)) IF LOCATED IN THE UNITED STATES OF AMERICA 

( ) WOULD QUALIFY AS NONPROFIT SCIENTIFIC OR EDUCATIONAL UNDER STATUTE OF STATE OF 
THE UNITED STATES OF AMERICA IF LOCATED IN THE UNITED STATES OF AMERICA 

(NAME OF STATE — > 

(CITATION OF STATE ) 

I hereby declare that the nonprofit organization identified above qualifies as a nonprofit 
organization as defined in 37 CFR 1.9(e) for purposes of paying reduced fees under section 
41(a) and (b) of Title 35, United States Code with regard to the invention entitled 
GLIAL MITOGENIC FACTORS , THEIR PREPARATION AND USE _ _ 



by inventor (s) Andrew Goodearl et al. * 



described in 



( ) the specification filed herewith 

(X) application Serial No. 036 , 555 • filed March 24, 1993 
( ) Patent No. , issued 

I hereby declare that rights under contract or law have been conveyed to and remain 
nonprofit organization with regard to the above identified invention. 



Cont'd 



FELFE & LYNCH 
805 Third Avenue 
New York, New York 10022 
(212) 688-9200 



Verified Statement (Declaration) Claiming Small Entity Status 
37 CFR 1.9(f) and 1.27(d)) - Nonprofit Organisation 
Page 2 

Appl icant /Patentee : Andre w Goodearl et al. 

Serial/Patent No.: 036,555 

Filed/Issued: March 24, 1993 

For; GLIAL MITOGENIC FACTORS , THEIR PREPARATION AND USE 



Attorney's Docket No.; LUD 250.4-JEL/CHT 

If the rights held by the nonprofit organization are not exclusive, each individual, concern 
or organisation having rights to the invention is listed below* and no rights to the 
invention are held by any person, other than the invention, who could not qualify as a small 
business concern under 37 CFR 1.9(d) or by any concern which would not qualify as a small 
business concern under 37 CFR 1.9(d) or a nonprofit organization under 37 CFR 1.9(e)* *NOTE: 
Separate verified statements are required from each named person, concern or organization 
having rights to the invention averring to their status as small entities. (37 CFR 1.27). 



NAME 

ADDRESS, 



( ) INDIVIDUAL ( ) SMALL BUSINESS CONCERN { ) NONPROFIT ORGANIZATION 

I acknowledge the duty to file, in this application or patent, notification of any change 
in status resulting in loss of entitlement to small entity status prior to paying, or at the 
time of paying, the earliest of the issue fee or any maintenance fee due after the date on 
which status as a small entity is no longer appropriate, (37 CFR 1.28(b)). 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code, and that such willful false statements may jeopardize the validity of the 
application, any patent issuing thereon, or any patent to which this verified statement is 
directed. 



NAME OF PERSON SIGNING, 
TITLE IN ORGANIZATION 



Llovd J. Old 



Director 



ADDRESS OF PERSON SIGNING 



1345 Avenue of the Americas 



New York, New York 10105 



"^Igi&t^e 



Udate * 



NAME OF PERSON SIGNING.. 
TITLE IN ORGANIZATION 



Edward A. McDermott, Jr. 



Secretary and General Counsel 



ADDRESS OF PERSON SIGNING 



1345 Avenue of the Americas 



SIGNATURE V 



New York, New York 10105 



DATE 



FELFE & LYNCH 
805 Third Avenue 
New York, New York 10022 
(212) 688*9200 



COUNTS 1-125 




I 



Counts 1*125 

x 

e 8 1 § • I £ 
c 4 i I i I i I i ' • * 

6- 




Counts M25 




i 



Counts 1-125 




Counts 1-125 




_ # a* •« »• 

Vi h •* 71 7> 



22222222*222222 5 

S5S33S3S838S23S 2 




m 
3 

■4 

i 



3 ! ! 



5 5 

i ! 

3 
3 
3 





M -» 



8 8" 



7 



m e 

* n 

r x 

w * 

r ■< 

X < 
91 



O *4 W 



H < 

o r 

n C 

e > 



r 

91 



> H > 

W « •< 

> X 

r > > 

> ca ft 

> « X 

r o > 

* o » 
> 



> H > 

W A W * 

r r o 

> w > o 

O Cft o > 

n » n x 

« ft •< H 

« r w n 

« x «< 

2 r s 

S X X 

91 91 



> 



u £ 

JB JB 

5 S 

1 I 



88 coooeoSS 

!i I!S!!S!I 



«* a « u 



%tf * V * W 



Rgure 11 



Trypsin peptides . 

GGF.JI01 m VHQVWAAK' • (sed id bo 

GGF-H02 K/R YIFFM EPEAXSSG (seq is ho 

GGF-U03 K/R LGAWG PPAFPVX-Y (SEQ is so 

GGF-H04 K/R WFVVIEGK* (SEQ is so 

GGF-1I05 K/R ALAAAG YOVEK* Histone HI (SB 5 ° "° 

GGF-1106 K/R LVLR* (SEQ is no 

GGF.II07 K/R XXYPGQITSN Trypsin (seq n> no 

GGF-II08 KttASPVSVGSVQElVQR # (seq zs mo 

GGF-1I09 K/R VCILTV AALPPT (seq d no 

GGF-II10 KttDLlLXV (seq id ho 



GF-ll 11 
GF-U 12 



Lysy5 Entfopeptidase-C peptides 
KVHQV WAAK* 
KASLADSGEYMXK* 



(SEQ ID K>: SI) 
(SEQ IS 60: 52) 



figure 12 



GGF-ll 01 
GGF-ll 02 
GGF-ll 03 
GGF-ll 04 
GGF-ll OS 
GGF-ll 09 
GGF-ll 11 
GGF-ll 12 



VHQVWAAK 

Yl FFMEPEAXSSG . 

LGAWGPPAFPVXY 

WFVVIEGK 

ASPVSVGSVQELVQR 

VCLLTVAALPPT 

KVHQVWAAK 

KASLADSGEYMXK 



(SEQ ZD 
(SEQ 19 
(SEQ ZD 
(SEQ ZD 
(SEQ ZD 
(SEQ ZD 
(SEQ ZD 
(SEQ ZD 



BO: 45) 
10: 46) 
10: 47) 
BO: At) 
BO: 49) 
BO: SO) 
BO: 51) 
BO: 52) 



B 

GGF-ll 10 



Novel Factor II Peptides • others 
DLLLXV 



(SEQ ZD BO: 53) 



Comparison of Brw J JSA and (1 25 O^dR courroriL ..leihod for the 
DNA synthesis a&uy in Schwtnn eel) culture! 




Mitojenic response of rat sciatic nerve Schwann cell toGGFs 




DNA synthesis in rat sciatic nerve Schwann cells 
and 3T3 fibroblasts in the presence of GGFs 




Mitogenlc response of BHK 21 C13 cells 
to FCS and GGFs 




Survival and proliferation of BHK21 C13 call 
microcultures after 45 hours In presence of GGFs 




Fig.f 1ST- 



M!tc;eni: response of C5 ceils to FCS 




Mitogenie response of C6 cells te aFGF and GGFs 
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S41 
542 
50 
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$45 
S4( 
5S1 
5CI 
S(t 
(09 
(10 
(49 
(SO 
CS1 
<l2 
(S3 
(S4 
CSS 
(S4 
(59 
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TTYJULRWHCXYOCHCXYKi 0C7X-X 

cATwxYTcmY7c«««ci «?x-a 
TcrcowwcmTowri wrz-w 
wrraiCW«cA«TCHat! «n«XJ 

CC&XTXXCCXTyOCXXCYW CCfX-17 

oo-gcccxxxcytowcxxci wrxx-x 

CCTKCiX7XXQCXC3\tteCXl M7XX-4 

TOf«3UUL*?XXeC5*«l GC7X-XX 

CCS«XXCH&CfWm«Ct GSfX-X4 

OOfSCrXWWYTCYTWSei «fX«l4 

TrmWCYtCXXCXXCSAAi «FX-XS 

TTrTOWYWYXAXXCMAI GSfX-XS 

TGXXQttCYTCYWXXei GS7XX-4 

TCKXCYUrKYWKXC! CCfXI-f 

CX75tfXYTQ*C»GX*7CX«e! OC7XX-M 

cxmxYrcwcacmowsi «rxx-u 
ns>j*TcswxA#«xsGcmi «rxx-X2 

XGXRTOSCXXffXGWGCYTr! «fXX-X2 
fcCTSTCXCCYXAHCXHSCrrii OCfXX-W 
^CTTrTW&OaSKGXHOCmi «rXX-12 

Hsxjncx&crwjtcwocmi «rxx*« 
KOjjtrcx&cxxcRciiwcmi W7XX*X8 
RCTWQf&CYXAXcwocrni «rxx-w 
*cT*cTswouG*cwocrrr-i osrxx-xi 

ACXXWfiXRXWOCrOWfiX! CC7X-13 
AOttO<GXAXW«X5YW*l 
GXYCXJWTWWCK&OttA! «fIX»l 

rrYCTwcTXxnsxwoaxi «rxx-4 

XA*GO«AYfr»CXYAQ*Si.l W^X-l 

oTx^f?unw-^*ixjtni °*" x r! 

KXCrrTYTDCXRSXmGWCi GC71-X7 



(SEQ IS 
(SEQ IS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ tD 
(SEQ XS 
(SEQ XS 
(SEQ D 
(SEQ XS 
(SEQ IS 
(SEQ IS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 

(SEQ XS 
(SEQ XS 
(SEQ EC 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ XS 
(SEQ D 
(SEQ XS 



BO! 
K! 
10! 
■0: 
10: 
BO: 
WO: 

10! 

10: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO! 
BOi 
BO! 
BO: 
BO: 
BO: 

BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 
BO: 



54) 
SS) 
SO 
$7) 
58) 
59) 
40) 
41) 
42) 
43) 
44) 
45) 
46) 
47) 
44) 
(9) 
70) 
71) 
72) 
73) 
74) 

75) 
7«) 
71) 
79) 
40) 
41) 
42) 
43) 
4*) 
IS) 
44) 
47) 
44) 
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(SEQ W HO: IS) 



KX M2XXU Kl nc?Cft I X» flCTOl XX 



nam 2: 



el 



«n«r»tt POL priMrs 

CCGUTTC , TG^OW«XYKa«HaA*rXYXWl *• 
CCCUrTCraxttO<«XYXTYea<SX*TXYXTI 

XAWAJiXtMAQOOWmYraCCWtTRTCl 
CCCUnC?«A«Xy CXXCTW«««Xfl«XX ! 
CWXXTTCT0CAfiXtSmfmXT«XRCC»6XM 1 
CCCXXnCTJOkO««XCCMCOWQ<TrYCQ*« 1 
C«JUTTC1WCT«WCTWT*XTBGA*M I 
XXMXTCCr«X5YrWMWC«a>CX«TCW« 1 
XA«XTCCT«X5WT«^Y?CCXrWL«AA! 

AA«x«crau^««RXJO*«raQ**afeei 
xA&sxTccr«AcrrTXCCYKriXTxxcxxcMxei 

CATWXYTOttA YW»«AA04ArCCT«Ae 1 
C«XXTTCT0CACXX>«KCXY<5OtCXYXCXCX! 

ec»fecYXX>««YJiCYTWMAxe4XTecwewi 

TCX 5CWARTXXCCX«XX«XTCCWCXC J 



ecrx-17 


(SEQ IB 10 


: fC) 




(SEQ IS BO. 


! 11) 


eerxx-i: 


(SEQ D BO: 


! «) 




(SEQ ZB 13; 


if 3) 


«m-i2 


(SEQ ZB BO; 


: t4) 


CCfZW2 


(SEQ D Mi 


: *S) 


«nw 


(SEQ ZB BO: 


»4) 


«rzx«a 


(SEQ ZB BO: 


. 17) 


e«nx«j 


(SEQ ZB BO: 


! te) 


ecnx-4 


(SEP XB BO: 


1 «) 


ccm-i 


(SEQ D BO: 


! 100) 


«m-a 


(SEQ D BO: 


! 101) 


OSf2I-3 


(SEQ XB BO: 


: 102) 


ecrxx-4 


(SEQ ZS BO! 


: 103) 


oerx-a 


(SEQ O BO: 


: 104) 


«rx»x 


(SEQ ZS BO: 


: 10S) 


«r:-i4 


(SEQ ZS BO: 


: 104) 


«f2-14 


(SEQ ZS BO: 


: 107) 


cerxx-i 


(SEQ ZS BO: 


: 104) 



CTni^ut PC* pri»«ri for fteter XX 

CI is© S«<7u&Ae« Cc»tnt 

711 CATC^TCT^WCT^nCW4X5XXTX?XT<WCX! 3» *ACX (SEQ ZS BO: 109) 

712 AJ^XTCC^JCA«CACXlCriXWTCCXCXTC5XTr! 3' tXa (SEQ ZS BO: 110) 

713 cccJUkr^r«atf7«xtcAOCUu«>J^xxwxaLt 3' racx (seq zs bo: hd 

721 CAK3ATC?«X«CTXJrr7r«7^TC\CTrr0C^ S» JACX (SEQ ZS BO: 112) 

722 XX04A*CCTC««AIAriCFXA^TCA£«AJrWl I' *XC1? XHCECJLZ3 (SEQ ZS 
72$ XXCO,TCC7«<1]l«^COCXCI)0«X3V XM4f X (SSQ ZB BO: 114) 
72 4 CC^riWO^SXACm^nXWAAXOCI XXC4T A (SEQ ZS BO: 115) 

771 CXTttXX*ttXMXGr&4^^ , XZC« 1+X «g ZS »: 114) 

772 XTX«CCO««^ACUt«A5«T^ak(^CCTGC8I (StQ ZS BO: 117) 

773 JUkC^^ec?C^5r^^>lUCC^«CACA4ACTCCTl WCSOMD (SEQ ZB BO: 114) 

774 ATAClX«<JCT^CXWXCArrTCXCACXCCZ«OTGA! ECOKS B*A (SEQ ZS »: 115) 



Summary of contijvMrt GGF-il eONA ftrwcJur « trtf if^jf'Oi 



p 71 1/713 



P 71 1/713 



•son e «ion eft «rai d 



2 



P 437/712 




p 711/713 



p771 



p*87 



pe7i 



p776 



p722 



P«7/712 




P4S7/712 



p€72 



p722 



P 671 



p722 



t 



672 



p773 



p773 



ricctr 24 



AXrrrtri preyed of tefr* Gtf-t 




newts 26 



PtftiC -»tlfi«4 it 4«4uc*4 u 



.14 s*^**etf 



PtpUdt fet . Stqutnct Mteh 

21-1 VHCVWXAJC 

1: XCVWAJJt MCU (SEQ tfi BO: 120) 



14: CC1J3 dflltV RLfiAV (SEQ Xfi BO: 121) 



xx-o: x4xv«?wirrTOf (seq id bo: 122) 

31: VLTVt l?tv;hHfP*€? RU3D (SEQ D BO: 123) 

xx-oj Yirnapxjocsw (seq a bo: «*) 

41: KIWI TirTX2?EAJ<SS« CKXL (SEQ D BO: 12S) 



XX-4 IVX* 

105: VAGSX tYlA CCT5S <"Q O BO: 126) 



X-lt mCLKTJWrXXXTVX (SEQ ZB BO: 127) 

113: CrrJS «y»iUfVvfXn?l*l SW» (SEQ ZD BO: 124) 

xi-12 jy^ykDscrrxxx (seq d bo: 129) 

151: ELMS KULADS6EYXCX VIS XX (SEQ D BO: 130) 



2-07 AJIADEYIttOJ (SEQ X» BO: 131) 

152: WIS* 4$:»d*5«i>ck VXSJX (SEQ » BO: 132) 



ncoz 27 



I I I t i i |i I t t < < • I I I t I M I I I I I H I M 

• •«•• • • • • • 

CtllCtfttlfir'tt'lA f t f U M M M I O M 

CMllClMHUHillHHlAMHflltll 
S C t i I • t 4 I A ■ t t iflllfftCillAIIifitC 

* I C V c t t t • 

• • • " * 



(SCQ ZD *0: 133) 



MHiiti fit UlliitftHAMIMMl 

• • « • • * • * * * 

UllllllltlMIIMl ' « « nMitruuM 
II IMMt(HMnil(ll>MillllHIII 
IIIUfUdtlllllHlMlM t-J_l I i • I t I 



jMiimtiMtimiiumimtiini 

stic I t t 4 t * ■ 1 t 1*1* LJ-I m t 4 » t II 



*^l^lllt*•••l«'•»«» i, •' ,lfi< * <t ' , 
#li4•ettl«»tl••t«l•*•■•ii»l*««^^• 

. . • • • • *• 

Tr-U^ w? l, "" J -"* ' , | ., int , l .. 1 ,....,.......^jjif>ftt^erAJAiraf!cmTea«HiJii 

(SEQ IK 10: ll*> 

ncox 281 



• • • • • * • , • * . 

i t t i iT TT •« i mittitmniiMiM 

. . . . • » « » » . 

tllKlllltlMIIMl • «»•!* » « t I » • « 

IHM MliHMilllUiMHdHUlH 
4 * i i i » i i t I t I • i » • • i « ' « * ' 1 i-i-I • i I • t i 
«>tiiii*.tt»«t«tit««**»** l,ll,, ** f * 



W t i V * ^ ■ m » ■ — — ■ ~ — — ^ • — - — » - - — — — - ' 

i c i • t ♦ t 4 i « i ' t i f i t L-i_i nnimmcj 

~ T ^ . . ^ ^^te^».iAfTiBigia?icr cugemircfl ct txiuimauuiun area m 

i rfr t ...T.. v .,t W :L^ wl ^meMTiI ^au <m t» 



triune 

(StQ ID »: 135) 

T15CU 2SC 




ricnx 29 



ncvu si 

COOZKC SrCKtXTS Of GLIAL C*OWTX fACTCVKIttCVLXK COTC 



CODIKC StCXtW ft (stQ 0 lu) 

eSCeOCTOCCtteCWAWWACCeCGGGCCWAeeCTAA^ 110 

?G«AWCttC«ACMA£«CACCSAtt«A5CG«ACC«(&6Cra 110 

CCAW^C9WCCAOOk««A«CACCC«WACnCCT«C^CC«at«5A«COCCC«CC 140* 

ACTCCeAKTGeceCGGAeCCCACGTXTOfCCCCGCGCm 900 

ccrccceeeacttcocccoccccTOW^TCo^ i«o 
litl Hllil! II III III II I I II III Hill 

C9C5»9C9CCte*?c?c55ecgctc?ctctc..cccctcs*s?g4e 
AAACnTrCraAAGCCCAWCttWCTWACCCAM 410 

lllllllllil li 1 1 1 1 f 1 1 Hllil IHH II Mill III 

k t z s t t e 

C^^«CCTCa?C^CAGCCTOCACrTCKO«CCAW 410 

ii nimmmiiiiii I i mi hiihhhi mini mini 

cg«?i ? c^tcc9Cft*5*gcgctc.c^tctcc9gc9*S4t9tceg*se9ci4»g4«g9e 

x c *eiee'xxo**e s e x * t v r . a * 

i linn liiiiiiii 1 1 1 1 1 1 1 1 mm nil imiiii i mini 

X x s * * 

a c e i $ f a 

ii iii in iiiimi 

CODIKS SKXXW II (5B5 ID K): 1J7) 

• • • ■ • • , ; 

CCGATcArtTCTWcCWCgAMMC95$6*CTr6AA6AA9CACrCGCraCTC 40 
KQV1fAAXACCLKXD«t,i*Vl 

eeCTtK^sCcWSsSCCAWCOCCTrCCCCTCcTjC^^ 120 
L C A « C *>AF»ieCXtX*DI* 

OCTACATCTfcTrcxwc^gecccAMCciACACCA^esgC^cCcfgce^ccnCefiA 110 

yzrrxsPCAysscepcKirs 

CCCTcCt7cCCCCC7CTCCAGAC6CeCCCSAACCTeAASAACSACgTCA0CCC'66T6CT6 240 
LlPPS*DC?tF«£O CepoAV 



ncuu 3i (con.) 

COOIKC «CXDfT I: (StQ O 10: 1M) 

it r* * i x *t x x s e t t * v- a c *t t & v- 

CCTTOCCTCCCCCCrTCJUUCGAGATCUCACTCAWACT^ < 0 

1 1 1 1 1 1 1 1 1 ! I II llllllllllllll il 1111! I! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 
cett?ccueec;4tt;tii;t?it9m«?cc4?;44te??ct;ei9;Uee«44Ct49 

t t e* x t s *s t t i f t x'.r i v'n v e 

WCXTCOeTOCCA6AeCA0TK«U?ACTCCT«CTCJ^^ 150 

I 1 1 1 1 ! 1 1 1 II 1 1 1 1 i i S 1 1 It ! 1 1 ! 1 1 f ! I ! 1 1 1 1 lllllllllllllllllll 
tccttc55t5t9«»Acc*yttct9t»t*ctcctctctcft(j*ttC44ft9ftteAig*»t9 

s x ta xx'ixro w x x* x « x *t ? o 

CSAGTCAATTAAC^CCAMGAACAAACttCAJUUKCATC^ 17 1 

in nun t inn iiiiiiiiiiiiii liiiimm mi in n 

XX X • 



CODING SXGXEX? A: (StQ D 10: 139) 

x s k'l xxsxas'l a D *S G X Y X C X* 

lllllMlllilllllllll lillll tilllllllllillllilll lllllllllll 

9f44gte4944Ctte9e4tt«tei»»9c»tc«et99et94ttet99t9<9t«t4t9t9e44 
*** • V " r • •• « • r — 

v x x *x l c i D i A* 5 ax'x-xvxix" 

AG7SA?CA0CAAACTAGGAAA?SACAff?G*CTCTCWAACA?CA 120 

IIIIMIIIMII lllllllflllftfllllltfttll lllllill Mil! Hill 
4rt;4tc&(;c«44tt499444tf»»9t9eetct9ce4at4te4ce4te9t9944te444 

A 

CS 122 

II 

eg 

CODING SXGXXX? A' I (StQ O BO: 140) 

" X S X L X X* t X A *f i A D 

S C X* YXC'XVIS It t C* K D |'n } 1 » 

ATTCTC^ACAATATATCTCCWACTGATCAGCAAACTAOCAMTCXCACTOCCTCTCCCA 110 

ITX*VtS*HCJCRCtt*Al"fO$t 

ACATCACCXTTOTCCACTCAAA«5TAACACATCC«ACTCC6T6CTATnCTCACTCTC 2*0 

R c v : JC V C 5 H T • 

T>A:^--A/:r:A-:Aw>.::TA>:::::::Ar>.rTT-AjkTCAc::A,sc::::;:.;>jk::r:A: ::: 



tlCVU )1 (CC*T.) 



CODING SICttX? 6: (SBQ 10 »: Ui) 

I! TT*€KF*A ST t ! A t! T I f'tt PI 

llllll llllll lllllllllllllllll ill 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 
*9*tc*tc*ct9yt*tgcc«9CCtc»»ct?**99»9C»t*t5tftcttc»9»7tctcce« 

I c 
* X •* T • * *1 « * 5 T •* 1 

i i f i 1 1 1 1 1 1 1 1 1 1 1 1 iiiitiiii niiiii n 

tta9*»utc«?t*tcc»c*fa*??*?c*i*t*cttcttc*t 

CODING StCJOXT C: (SBQ » »: IM) 

T S H tVf'cA I *K K K 9 

i urn inn ii imiiiiiiiiiiinn n inn mmiinin 

Cttc»tct*c*tee*ce»ct99i*«*9 cc *tcttrt4***tftge9gi?n?g«?***» 

rev' Kcc'teritvK&'x,* if *> i i i 

1 1 1 1 1 1 1 1 1 1 mi 1 1 1 m I'UiJUUiiiliiiUUililJinUiU 

1 c 

llllllll 



ncsu 3i (corr.) 

CCDIKC SKXIiCT C/0: ($8} » XO: 142) 

* c o ic ft'cai'ct S * V > «' X V. * * 

McwccAAcc7^rrcAcrcacccxcAraACTSAGAAT«x*^ 

nmimimiminiiim imiMiiiiiiiiiiMiiiiimiiHii 

*»;t9CC4tcct9;tttc«ctg9i;eat9(t9-ttctgig*«t7tgccc»tftii9teeAA 

* Q X 

ACCCAACAA CS 

I IMIIII * ' 

X 

CODIKS StCXEXT C/D't (SB} O »: U3) 

X C ? i X f t* C * X *C 0 Jf Y V K k I f f " 

AAOTCCCCAMTCACTmCTWraTCSCTCCCAJU^ 

t i I f f I ! I f ! 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 II 1 iX! I U4 1 1 1 1 1 1 1 1 1 1 1 f 

«*7t9cec4ftftt9i^tUet;7t9«te9ct;ce*4t«ctfteft«»t99eet;ctteUe 
CODIKS StSXZXT 0: (SB} IS 10: 144) 

sT8T*f--i*fit*x«' 

AWACCTCCACTCWmV W ICICI OCC T S AATAO 3f 

miimimmiminiiijiiijiiiiii.. 



CODIKS SEOODTr D't (StQ » »: 145) 

XBtCXXF*XI 

te3c«teU99;ttt9*tttttt9gi9 37 



ncvu 31 (COTT.) 

COD2KC SrcXWT II (sb* ZD 10: Hi) 

* A Z I l.T Q t t V *t T Z T C Z C* Z A I * 

ii fiimmii (iiiiimiiiiiiii HM! mum iliiim n 

i v v *c i x c v v v" t c i! t i r o i i r . 

ii inn Miiiiiiiimiiiiii iiiiiiiiimiimiiii iiimiil 

t 8 B ft l * Q* f l * *• I ftVTIfX*yVA* 

CTl^TCAeW0CT7CC^CACCCTT C0C TCT g mCAM»eCATGAWAAC0TA0CC XI 0 

!! IMIIIII llllllllllflllilllllllll llllll IIIMMII i (It 

If C TilTt'ftf'l K V 5 I T n r ? ' 

ii Hin him ii ii ii minimi ii iiimiiiiiiiinum 

**t939cctcicc*tcct4*ccc*cccccc5ag6»tytec*<jct9rt5»»tc4it»c>ru 

S K K • V Z f l"« IZ*V 1 ft t A t t'f Ft* 

gCTAAJUaTOTeATCTCTAOCCAOgATATyorrCACACACAO^CCCACAO CICI ' nTlW 300 

1 1 1 1 i 1 1 1 iiiiiiii ii iiiminiiiiiiiiiiii ii mi ii iiiiii 

T S H T T « *" All's T TVfQT* Fit" 

A CCA 5TCA C7 ACACTI^CACAOCTCATCATTCCACTACTCKA CTCAGACTCCCACTCAC 310 

IIIIMIIIII Mill iltil Hill lllllliimill IIIIIIII M HI 
*ce&9te4et»t»ettec«c*9cce«te«etce«et«ct9tc4eee49«cteet*9ee«c 

S ¥SKCB**f«Z*Z Stilt V* Z V K * 

AGCTWACCJUTCCACACACTCAAAfrCATCATITCCSAA^ <:0 

iiiimiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 nil iimmiiiiii iiiimii 

*5ctq5iqciie95*e»«rt5*n5«tcctttcc?4**5cc4rtctgt*AtC7tg*t9 

- L 

s tYiysft*isi**¥ce*fcc°fttv # 

TCATCWA5AAMCA«A0^CA^5WXACI««O^ 410 

imiiiiiiiiiiiiiiiiiiiiiimm iiimimi iiiimiiiii in 

tC4tcc9ta;ttitc4rtag9eaM9ci<;cccakct99999cccta;a;9»c7tctt4tt 

COCTT^WCAWCWCCTCAAWAACA&CTTCCTCAOd^ 540 

III f t ! 1 1 i 1 1 ! It U ( 1 1 II It 1 1 1 i I J i i J 1 i 1 1 U i U 1 1 M f 1 1 ! 1 1 1 1 1 1 ! 

T 

SYRCSfHStR 

TCC7ACC5ACACTC7CCTCA.TACTGAAAG $€9 

I !!•••• 1 1 «||: !MI IIP iiiilill 



ncuu 31 (con.) 

CCD2HC StCXIXT *1 (SB} D »: Ul) 



T > « F t C * 



COCIKS StCXrXT X»t UBl » »: U7> 



iuiauuuuuiu»n auLU * u ^"'""""'T 

D 9 9 • • 

cIcc;cCCC6CWCC^6^^«f^],| ,,,,,,,, ,,,, 11111=* 



riCVU 31 (COKT.) 

& * K *T C A © S I *" f I f T l i f t y" 

inn mini n i 1 1 1 1 1 1 1 1 1 1 1 1 f f 1 1 it iiiiini milium 

CZ&V>rx.AXQ*K*L*AAStt AA* 

A«x:AAcmc«cmccT«caTACACAAcccecwaocacTcrccA6««« s<o 
Hi IIIIIMIIIMIIIlin INK milllllllllllllllll! Iflli | 
»9r^5*«9kt*c9cctttcct?;9C4t*c»9**ccccct99c*9CC*gxctt5*5gc4ic 

m ? '» t v m i t it m 'e c r i f « t" 

CCCT«CTTCC«CT«TCSACA0aWACTAACCCAACA(>MOOCTr^^ «00 

iimiiiiiiiiiii iiiiiMiiiiiiiiin iiiii i ii 1 1 1 1 i liiii 

*cct9CCttccoceto9Ctg»c*9C49?*ct44ecc45c»99ec9Cttctc^4C4c*994 
A AIT 

«to'Atti6VX # AlfO*D»XAV« 

ACUTT^OSCaOOCrCTCCOSWTAAT^ft 410 

nil i iniiiiim n mini ii iimmm ii iiim inn 

49444tcc49;ce4f9Ct9teU9t9t4vAtt9et44ec4494eeet4Ufiet7t«Ut«4 
CCCAAATACACCa?ASATTCACCWTAXAACTnATTTTA?ATAA?AM 720 

ii iiiii n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 mil iimiiimiimii 

CCt444t444C4e4t494ttC4eCt;t««*4ettt4tttt4t4t44t4449ttUeetee 

TTWJkTTJJLACAA 735 

MMiiiimii 

tt444tt444C44 



flCDU 31 (COKT.) 



Eu&u Ceding S*ga*nt E: (SBQ ID 10: 163) 

ATCJusxTooc^coccccccgcgccocTccoo ^^ <o 

KRWRfcAPRftSCIlfCPftAQftP' 
CSAARSSPPLPILPLLLILC 

a cc«o«c ct»c oc cooooocooc8 0 ccoccaacga w no 

TAALAPGAAACMZAAPACAS 

C T CT^ ACT ttg r C CCC O CCCXO OSTOOSA T CO^ 240 
VCYSSPPSVCSVQXLAftAAA 

OTMT^TCSXKttjJLxOOT^CCCOCX ^^ 300 
VV 2ECHVEPCRRCCCALORK 

ccoocoscoocooco Mcc jidXc a j w 3 so 

AAAAAOEACAVCCBREPrAA 

cocccAi»»co c n oIjttcoKcco cecAesA xc^ w 7 c« cscc>jcooGACcgTs «20 

CPRALGPPAEEPLX.AAHCTV 
PSVPTAPVP«AOEPCIXAr* 

• • • • • • 

LVRVHQVVAV*ACCX,X*»SL 

ctcaccctscocctosxjc^^ coo 

tTV*X,C?KCB*AF*SCG* L'X 
EDSEY 2 FTKtPDAKSTSRAF 

GccoccTrccaxc ^ v Ii ^ ^ 7:0 

AAFKASFFFtX?C*NI,K*EV. 
SRVLCKKC 



W B K: I4|) riwu „ 

C6F2WFS nueltotidf ttqutftet *nd gtduetd prettln itojvunet 

ACrrTCCCCCCCCAACmtCCaACTCWGCTCC^ «c 
WCWWC^COCWTO^^CCWCCCCXCSCTUTCOCCTCW ISO 

CCA«0$CS«CCAGCAC«A©CCACCCCO^CnCCWC^ 340 
ACTCCCASST'OgCCCO<UC^ W n re CTCCCC O CCCrCC^ 360 

• • • • • • 

lAA lTmvVUV XAOCCOAtCCCACCCCrcO^ 420 

CGCCAWCCTCCSCOCACAWCT^CTWTCSO^ 410 

i s x t * t e 

xexexeoxxp*ciexx»?*>A 

CCrCSCSSCCCSASCCCAS C Vn ^i 'Iill l^ Ti a U^ «06 

a c c r i m n m i f i x t m i i 

CTOCCASSTTCCAAACTA U l w n W i ra AttACCAS^^ (40 
VACSXLVlfteSYS ftYfltf 

TTCAAW«frCAXC>AT^CTCAArrM*^ 720 

rxvrxxcitLfftxxxreixx 

A7ACACWuJo^C«^AA«aSAJk^^ 710 
ICXXPCXftLXXfXAXlA&l 

OvACXATATAWTCCAMCTCATCA0<iAAACTACCXAXTttCA 140 
C XYXCXVXfXLCXOfAtAIX 

ACCATICT«^CTCAMCSAttTCACCAC7COCAT5C« fOO 
t X YXillX** 0 ** ******* 

CT ^ lC^^lASASTCTCCCAHA^tAramT C ^ t<0 
▼ S ftf»X*Xt*t 

TCCA»TcdlcATCrrACA«W^CAAOCCATCTrC?^ 10*0 
S * S f •tACTSI LVKCAXIIX 

' ACmcWTSTSAAraAG^OTOCTT^TW:^^ ICIO 
T FCVHCCXCfKVIDLlIfit 

TACT^WCAACWCCCAAATCXCmACT««TCATCOCTOCCAAAACTA 1140 
YLCXCFUtf * b « DXCQKTVXA 

ACCrrc^ACACTACCTCCXCTCCCrrTCTCTCrC7CCCT5AATAWCCciT5CTCACT« 1200 
SfTflTXTfrJ-**-'*' 

07ccc3rrrrcrr5tt5tt5Cx:c::ccr::A5A7r::Ar:TAsx5C7x:xrc«rTTTA mo 



ricsu 3: (cdct.) 

TOACTlCCTCTffTCCCTCAWACTC«CTWXO«XCT« U|$ 

TSHTCTCJUUmJlTCHWmCTSt^n 1440 

CAATCACAATAMGCCCTKAAAACTCtCACTCT 1500 

C^ACACTCCCTCTTCTnATXAJ^WACCCTATCCm^ i$40. 

taacc^ctac»kcacttcaaatcatcc7aacttcc c itc oc 1(90 
acaj^taaa^caatajuujLaaamaamaaaa 1<3> 



(SCQ ID *>: U9) FICWU 33 

«mm «.i-tw. m*".- •«« freuln 



(SEQ 15 K: 150) TIC5JXX 3* 

oernm meliotU* «M >*•«•»« ••* J,B " 

i;ve««*** , * t ' T . 

■*cCCCAA5CTCCCKAACT»CCeseTJCC-«AATC...- ? % $ . . 



T7 

.. L 



ricm 3* (cokt.) 



«?C?CCXrcCC?CCA?«C«TCX<^ 1200 

vixfiKAViFFVi^iiifii*.. 

wwxcoccxcawoacwwxcutt^ 1210 

VTPF*fc*«** CKH . 

CCACIOettCCCCOCO^^ *«• 

eewcAceuTATox^ mo 

CACCAACAOttGCCW^^ 1440 

thss**a****p* 6H2 * b *\ 

OeAAATOGA^^eAC^^ 
XXD*K*6At«*** 1 * I * ,i '* # 

AAGACTXWXCXAGXTXCW^ 

CTAXAACCCXAATXCXCCCXTX^TTCAC^ "<° 
« 

TCCXCCrrxJLkTTXXACXXXXXM 17 «4 



1500 

CTC7CSX 1S60 
L S 

TTCTCTCC M20 
UIO 



ncuiz 35 



l(S£Q IB WO: 151) 
2(SCQ ID 1*0: 152) 
3(StQ IZ> HOI 153) 



TICUll 37 



7-1-1' 



f-b-x-c-c/p-d 

F-S-X-C-C/D-E 
F-8-1-C-C/&-B-1 
F-8-X««»«/&-E-X-X* 
F-B-X-C-C/£-D'«B . 
T-B-X««/&-D'«B»L 
? -B- X-C-C/D-&' -E-X-L 
F-8-X-C-C/D'-D 
F-B-l-C-C/B'-B 

F»8-x-c-c/&'»B-t 

f-b-x-c-c/b'-b-x-l 

f-i-x-c-c/o'-d'-i 

F-8-X-C-C/D'-D'-X-l# 
F-8-X«<-C/0*-D'-E-X-X< 

f-b-x-c-c/x/b'-d 
f-s-x-c-c/d-c/d'-b 
f-b-x-c-c/d-c/c'-b-l 

y . B~l-<-C/&-C/D ' -B-X-L 
F-B-X-C-C/D-C/D'O'-S 

f-b-x-c-c/&-c/o'-»'-b-i 
7 -b -x-c-c/d-c/d' •» ' •k"x« 



yix-i-A' 

7 •r»s-x«/D-B . 

F-X-B-X«C-«/&-B*X«lr . 

F-I-8-X-C-C/D-»'-I 
FiX-I-X-C-C/D-D'-B^L 

r»i-B-x-c-e/D-D'^B*& 
f-x-x-x-c-c/d'-d 

r-i-s-x-c-c/p'-E-i 

T«X»I-X«<-e/D'»I*X-l 
f»t»E-l-«/D'*»'-E 
r-X-B-X-C-C/D'-B'-B-X» 
7 ix-8-X«/& ' •B»X»X» 
F-Z-S-X<-C/C-«/&'-& • 

F-x-8-x-e</D-e/t.'-B 

F-X-B-X-<-C/D»C/D'-B«t 

F-X-B-X^</X/D'-B-X>L 

F-X-8-X-e-C/&-C/&'-e'-I 

j.X«B-X-C-C/D-C/D'-D'-B-X< 

F-X-B-X-C-C/B-C/D'«D'-B-X 



F-8-X-3-C-C/C-B 
F-B-X-C-C-C/t-B-L 

7 - b - x-« -e-e/a-s-x-x* 

7 - B- X-« -C -C/ P-D ' -X 

7-B«X-C-C-C/D-D'-E«l 

7- B-JL-S-C-C/D-D'-B-X-X, 

F-B-X-«-C-C/B'«D 

F-B-X-G-C-C/C'-l 

7-5-X-5-C-C/D'-E-t 

F-8-X-«-C»C/©'-B-X-l 

7-B-i.-C-C-C/D'-D'»B 

7 • 5- X-C-C-C/D ' »D ' -H-L 
F-B-X-O-C-C/D'-B'-B-X-l 

7- 8 -X-C-C-C/D-C/D'-D 
7 - 8 -X< -C-C/D-C/D' -E 
F-B-X-5-C-C/B-C/3'*B-X» 
7 - 8 - X-C-C-C/C-C/6' -B-X-L 
7-8-X-*-C-C/&-C/B'-C'-E 

7-B-x-o-c-c/&-c/B'-5'-f-i; _ 

7 - X-X-5 -C-C/B-C/O ' • 5 ' -B-X-L 



f-x-b-x-g-c-c/c-d 
t-x-x-x-s-c-c/d-b 

F-l-B-X<-««C/>E-t 
7-Z-B »X-« -C-€/>B«X»X» 
F-X-B-X-C-C-C/D-D'-B 
f-I-8 -X-C-C-C/3-&' -B-l» 
F»X-8*X«<-C-C/»-D'-B-X-X» 

F-X-B-X-C-C-C/D'-D 

y-X-B-i-C-C-C/P'-I 

7-X-B-X-^-C-C/C-E-L 

f-t-B-X-S-C-C/C'-B-X-X. 

F-X-B-X-C-C-C/D'-D'-B 

7-X-8-X-4-C-C/0 , -»'-E-L 

F-X-B-X-5-C-C/S'-t>'-B«X-X 

f-x-b-x«-c/&-c/&'-d 

7-E-B-X<-e-«/X/&'»B 
F-X-B-X-«-e-C/&-C/D'-B-l 

f-x-b-x-s-c-c/k/c-b-x- 

7-r-B-K-C-C/X/»'-D'H 
f-X-S-K-^-e/X/D'O'-J 
F-X-B-X-a-C-C/D-C/D'-D'-l 



?ICU*Z 37 (COKT.) 

oar/zx*2&Jii* ipuexxc vxsiajts coxtxxoid 
x-b-x' • 
x-b-x-«-c/>d 

I-!-JL-C-C/>I 

£-B-A-C-C/>E-X* 

X-B-i-C-C/C"-X-X-I 

X«8«A-C-C/C»D'-B 

X-8-A-C-C/>D'»8-L 

X-B-JL-C-C/D-fi'-K-X-i 

X-8-X-C-C/C-E 

z-b-x-c-c/d'-b-l 

x-b-x-c-c/d'-e-x-l 

x-B-x-c-e/»'*&'-x 

X-8-l-«-C/D'-&'-H-L : 

X-B-X-C-e/D'-©'-E-X-L 

X-B-X-C-C/X/D'-O 

X-B-X-C-C/X/C'-E 

X-S-X-C-C/X/D'-B-i 

X-B-A-C-C/X/D'-S-X»l 

X-8-A-C-C/X/»'-D'-B 

X- X -X -C-C/X/D ' -D ' -8-l» 

X-B-X-C-C/X/D'-D'-E-X-l 



X-X-X-C-C-C/X 

X-B-X-«-C-C/D-X 

X - 2 - A -« -C-C/D-B-L 

X-8-X-O-C-C/D-B-X-L 

X-B-X-C-C-C/X'-E 

X • B -X-S-X/X' -X-L 

X-B-X-S-X/X'-B-X-l 

X-B-X-C-C-C/D'-B 

X-B-X-C-C-C/V-B 

Z-X-X-X-«/&'-B*l 

X- B~X -C-C-C/D'-E«X-t 

X-S-X-«-X/B'-D'-H 

X- B -X-5-C-C/C ' •© ' -X-t 

X-8-X-G-X/t>'-B'-E-X-L 

X-B-X-C-C-C/X/&'-D 

X - B - X -3 -C-C/X/D ' *X 

X-B-X-G-C-C/X/6'«X-1 

X- B-X-5-C-C/X/S' -X-X-t 

X- 1 • X-G-C -C/X/B ' «&' -1 

X - 8 - 1 -S-C-C/X/B * -C ' -E-l 
X-B-l-G-C-C/X/D'-D'-E-X-L 



tern 



MCCXTCniTCUGTCW^XCUMXa^COTCKKKJXTWAWCCXSwi 

TTcxTOOK^Gxccm^TecnewsxTxepjsewowceeAM^sm 

rxvjtDtsK»s»' (ts . * c . 

xcwwxTcscwcMxicrxewxxt^eA^ACAOtxcwecxcreccTO 

CTGTC?CTCCCTCAAT*C 
L S L t I • 

(SBQ ID 1*0: 154} 



ICTL2 



SHLvxeAXXcxTrcvxecze 

TTCATSCTCAMGACCTmAMTCCCTCAAGATACTO 

rKVrOLSKPSRYLCKCQP^f 

ACTCSACCGASATGTACWAWraSCCCATCAAAGTCCAJ^^ 
TCARCTIKVPKXV QTClJJAt 

CAGCTCTACTAA 
Z L Y • 



(S£Q re HO: 1SS) 



nccu 35 



zcrx.3 



AGCCATCTWTCAAlttttCCAGAGttGGAGAAAALTlU^ 

TTCATGOTGAMGACCTTrCAMTCCCTCAAGATACTrGWCAAGTCCCC^ 
FKVXO LSKtS R V tCKC PK tf 

ACTWTGATWCTGCeAMACTACCTAATWCCJtG^TTCTACA^ 
TODRCQKYVXASf YXAtZ L T 

TAA 



(SEQ IS Ki 156) 



neciz 40 



zcru 



SHLVKeA**** 1 '. 



(SEQ IS MO: 157) 



XCfLS 



• ♦ 

*^CATCrTC7CJUC7S7SCAGXG;W«AGA-U>C 

SKIVXCAI**** FCV,CC CIC 

TTCAraWXAAGXCCTITCJUUtfCCCTcixGX 

FKVK01S1CPS RYLCKCQP CF 

ACraXGWXGXTCTXCTCXGXATGTCCCCATCXMGTCC^ 
TGARCTINVPKKVCTQEKCP 

• • • * • • 

WtfGAGTmCWOTCATCOCWCCAAMCTACGTAATOG^GCTTCT^ 

X £ FTCDRCQK YVXASFYSTS 

XCTCCCmCTCTCTCTOCCTCXXTXC 
TPFLSLP«« 



(SLQ IT HO: 158) 



ncuxi 42 



tcrw 

I L Y « 



(StQ IS *0: 15S) 



f IGUtI *3 



ricuu 44 




e • * * • 



i i i i t n o t u n » « i * » « » 
. osrxx-* 

i * * » * o * i v c » i j i { j ; j • ; 



0372X*t 



1 i i T C t v VI t o X 

Q 0 0 k h D X t k A k k k « X k O & If O 

• • • ? - - • 

_ A * 



(SEQ IT NO: 167) 



ncyai 45 (i ef 3) 



X*CXX*tYl¥XVXCVVAVXA 

XVSQVVX1X 

axrxx-i & «nx-n 



* © t t x v xeAtrott* 

! r i cox i t c l oc t m^n w iri^ i r v>7 rxa ^ Tg c T W Xg oQ oxoag too 

t 7>*CGXLXX&SX?Xr7XX>D 
f ? F V X * YXT?«Stl 

j carn-2 

ax ucx raj c tocc o ^ ICO 

XX4TfXAtJLX7XjL*r>>129 
1 X J # o 

ixxgcrexxuxadLxKoo ^^ lose 



CftXlXXXVftXVltCXac&fcfi 

Ci^xxKXfoxax x.o t x t v z. t c 

t V t % 

osrxx-f 

2?l<X YX5LXrXV/t¥GYXL 

VXXXXtQUX X X QXXtCXtXl 

XXXXAXLASJQXrXCXVXXX 
XAJLAfcfCXTfcXX 

oonx~i2 

tCHDS&XXVXTXVltV*?!? 



FICU££ 45 (2 cf 3) 



• • ♦ - • * 

**TC*IIlV*CA*XIX*fCV 

• * « _ • • • 

*oascrxvx&&*K»«**t>c& 

CIX*J*Ct>*CC*TVXAS*V« 
t I T ? } I I I M 1 

* • • • * • 

# • • « • • 



TICUKI i5 (3 of 3) 



% 125 I— Uridine Incorporation 





cn 


ro 


CO 






cn 


CO 


ro 


CO 


^1 


10 


CO 


3 


0 


CD 


0 




3 


3 




O 


3 


3 




O 


O 


O 


O 


-n 


0 


O 


0 


"n 


*n 


-n 


~n 


CO 


"71 


"T| 


Tl 


CO 


CO 


CO 


CO 




CO 


CO 


CO 


7; 


7s 


7s 


7s 




7s 


7s 


7s 



CPM 




fO © 



o 
o 

B 
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